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ABSTRACT 

A cellular array is an iterative array of identical information processing 
machines, cells. The arrays discussed are rectangular arrays of programmable 
logic, in which information stored in a working cell tells the cell how to behave. No 
signal line connects more than a few cells. A loading mecharMsm in each cell allows 
a computer directly connected to one cell to load any good cell that is not walled 
off by flawed cells. A loading arm is grown by programming cells to form a path 
that carries loading informatioa Cell mechanisms allow a computer to monitor the 
growth of a loading arm, and to change the arm's routet to avoid faulty colls. 
Properly programmed cells carry test signals between a tested cell and a testing 
computer directly connected to only a few cells. The computer may discover the 
faulty cells in an array; and repair the array by loading the array's good cells to 
embed a desired machine. 

Terminology and network models are developed to describe the 
characteristics of a machine that are imporUnt to the tfst and repair of an srray 
embedding that machine. Important machine classes are defined, and their test and 
repair requirements are compared. Computer simulations of repair aid this 
comparisoa 

Each machine class is represented by a particular ceUular machine 
design. Arrays are presented for realfzing highly-integrated, computer-maintained 
memories, such as variable-length shift-registers, rmdm^§cc9SB memories, and 
track-addressed sequential-access memories. One flawed array of simple cells 
may perform like any digital machine, within limits set 1:^ Ute size of the array. Its 
number of input-output leads, and the speed of its components. One such machine 
can test, configure, and repair its cellular environment. Applications for these 
cellular arrays are discussed. 

The thesis' approach is oriented toward the realities and trends in 
large-scale integrated circuit production; and has potential integration level, 
reliability, maintainability, and flexibility advantac^s. 

THESIS SUPERVISOR: Edward Fredkin 

TITLE: Professor of Electrical Engineering and Computer Science 
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CHAPTER 1; OVERVIEW 

Section 1.0: Introduction 

A cellular array is an iterative array of identic^ information processing 
machines, cells. Test of an array discovers its flawed cells. Configuration of an 
array programs it to behave lii<e some machine. Repair of an array programs it to 
behave lilce a desired machine in spite of faulty array cells. TNs thesis develops a 
practical systems approach to highly integrated, computer-maintained cellular 
machines. The structural simplicity of cellular machines gives them many 
advantages, especially now when large-scale integrated circuits (LSI) are 
proliferating. We specify cell mechanisms and outline associated SLHF>port programs 
for an arbitrarily large, two-dimensional, rectangular array. While we focus on 
two-dimensional rectangular arrays, our approach has obvious extensions to arrays 
with different interconnection geometries and more cfiroensions. TNs approach 
allows a digital machine to electronically test, configure, and repair an array by 
direct communication with only a few cells in the array. The fact that a computer 
can test and repair an array implies that the array need not be perfect All the 
cells of the array may be simultaneously produced as a very large, integrated 
array device. Such a device usually has faulty cells. After the array is first 
fabricated, a computer can find the defective cells in the array and load a perfect 
machine, which incorporates only good cells, into the flawed array. Thus the same 
mass-produced device may be program-customized by a mass-produced device, 
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the computer, to toehave Hk« a desired maehine. If tNt amy^-emtiedded rtMeNne 
develops a mm Hew during its operation, and if this flaw causes a noticed 
performance dtfTM^Mion, the array mny be partly or coMiite^ re-tested ihd re- 
customized % mtnmm^ Thus an mt^^-mb i ^tkm mi^^ be etectronicaHy 
tested, and Impaired lo^iAeiNlporito «fiy fdedciis.^^ ineansl^t thai array may 
be maintainei «y « dVM RMcHNm mmmmm tifi ifray miy be ri*«ilitofMzed 
et any tiffM. this «ppreiidi is UAored le the rMlffiei M^ J^^USa in design, 
raenufeetyre^ dto^«Bti««i^ mii nmSmmim tif mt^lt i^^ 
corapeaed of ig w i up cw e nts. 

We diaeast arrays of "p n og r a w wabl e tegi«f , ^vheiit lofiarnfation loeded 
into memefy etamaiila in ti^mKA^fmWk^mmMMUmii^ line 

cenrm^effiM^veHt. A tooding ntMMMii {iMitt|»ai^ a«^ 
««» eRows « eomp^ler i«tiic8r eoMii^d l^'ofliy laiyi ieH^fo «aid iry ^»od oeH 
»wt is 11^ wiNed «ir by Itawed ce^ Ifm^ithimmmmky^mitm computer 
sends to the array may aeitict one of i tarie set of 0M» $tm for i toeding 
erm that carriea iMding information to e etfl/ m di<i^eto# «itf «ned)ahlinis that 
aHow the computer td iiidnilor ^ growth (rf a lead^ ar^ ti a eiM, and to change 
the route <^ the mm to evoM fatiMy eelNi: A mithed Is deao^aed for testing cells 
inanarrayby uib}gaiestmad«nedb'««%%ifiiie^1^ the 

array. f»ropi% pregni B wia d ciiis carry fait sighaii1>^weirt swiie newly t^ed 
ceH and the test miBhine. A loadMg arm nipy Ibe used # v«y thi stito of the 
tested c^. 
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Programming an array to behave like a given macKifie if called 
embeckfing that macNne. When a macNne t9 embedded in an array, it should not 
allow faulty cells to affect its behavior. Therefore an emtp(td«d macJti|ne it 
programmed to ignore sigrwts sent from fauijty calls, We find thet the 
communication paths required betwaien tht etieappl cetif of an eiiil»edde^ inachine 
affect test and repair of an array for embedding that macNne. Oevelopnwnt of 
terminology and networic models allows us to describe,.embidded macbinea mora 
precisely. Important embedded macNne tAt^mif are ^ne4 and thair aaeo^atad 
test and repair requirements are detailed. Computer sMwilatieM of rapalr 
facilitate tNs con^wisoa 

For each class of macl^na Ihat'f #iii^|^ iftr^fr, piliollalty 
useful representative of th^ class is d^ta^ AK airqp* oetts conte^ our lofding 
mechanism. Arrays are presented for rjiiHz^ |ii|^,;,|e^^«iteid,. COfllPl^er- 
maintained roernories. These include vri^^ for i^|||Ui|iii;if yif iafe»l»*4ength eMf t- 
registers, random-access memories, and tritckridcb'essiMl se^quei^il^^ac^aaa 
memories. One array of simple cells mey Iq« pci^inpiiii^; to^ afl#afi an apbiirary 
digital machine, within limits set by thf size of the wr4|y» iU fHirab«r of ^^^k^ 
output leads, and the speed of its components. An array-Mobedded computer can 
test, configLrei and repair its celli^ar environment ipnt techNques we develop. 
Indeed, two or more irri^-eralatiMdcfed f^rnp^ 
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SKtion IJ: Ayriys /^€«ab«ddadiyheNnM 

A nmm di^i^ iiic^^^ 1^ our aj^^Mii rMMrvt Mraduetton of 
sornii k«y tomMi 

A tmii» mwf Wm wmv ^ liii^toMl^ldifiei inlttrmi^ precMtinf 
maehinm, attst mmmmt^ in ■ir^ri^ m^. llai MUm n -^im n iA ot ) d 
aiTay ocoipiM » tlH3eo i^oifit In Ml ft-^Mi«#toy apiS^ eeH eommunicotm 

diTBdfyitmmm midiMi through • mk Mi^ a^iliioi. Figaro 1.1 ohowo 
i poftil9fo iiyai^ itf^o eilitiir arri^i^ Eidi ei^ 

of tiiiMl ^mmtsm, ^§m corroipMiinf ^ polM^ #ici iotwnuhicotiori with 
anothw' coil, a nttg*k>r. If any mamlMr of a titfa-fof imiiiell to a no^Niwr, all 
m^ntert of ^)o t id O' a ir cowioct folhat iM^^NborJ^lfii Miii^ <looari*t connoct to 
a noifhbor, tonworiH # itt ni«MN#« i# l:<irM^ 
UntOAAoetod ^Kilt o^ at if tl^ afi ccihitilitf fo 1i Mfiary 0; tliia it aatiiy 
ifnpiwnantid Wo «OfitNMtlrato wi iHtdmrn^i eiiuiar arN^ Wo-dimdn^onal 
arrayt lika ttuft iHiBM^i in flfuro l.t/«rtioi« iaett oalf >^ 
and milpiit ^moi^ m& tA «ii^ nMt^w^ Wi %•• tfto fiM# 1^«oeiirb<b#r to 
iuggott an obviout ^yout for our amyi, vMtK iaeh ilioiiirioilt t(|ior« tending 
for 1WI idor^teai i»ll. $inm havo prof>9«M< arri^i Iff «i«i^ tignal buttat run 
tttrough many caMt. Sine* an array it mmtmm0^lmM^^ttm a tifnal 
but It iiam aga d, vmrmf^ihtkmm, ba no 1111 iWltln efiotikirboard irray^ 
at mott, a tignal Una connoett a eaii to itt four noifhbort. Chodtarboard arraya 
ara woli-tuitad to tha ttap-and^apart natura of eurrant ifitar«M efo«ult (lO 
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Fig. 1.1 Layout Of A Checkerboard Array Connected 
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Key: An arrow indicate$ one or more of a mapNne*$ inputs gr jMltputs. >|iith th« 
ii¥Ovi^:dfr*eU(:^ fN^ny "tfte dlrtttton ef mfMM^fiFile^;MlS^-M boxM" 
represent extra-array Machines. The smali, unlettf;ed ^OK9s,t|P!?*'f99'ltC!iii9>, 



fig. 1.2 |f^rcei«¥ioMi>n Network for. CiieekeUseaN 4Nm»<^FIfur» 14 
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production. T^fs* arrayt ^mmm^m^t^mmtm^ and r^r advantages. 

An sitmcMtfmtfm nttwork, Mfh m that of |ftjra 1.2, partially datcribM 
an arra/s la^f^Mt by MM^r^iMlia^Mir^ with ita eailtiar 

n«tghi»orf or HKtra-arri^ ii|a^iinat. In # ifltafeewiKlon nflwork, Mch nod* 
reproamta a cftt, and *«^ (janofid ra^a4^a an a^ra-lrri^ machina. A nodo ia 
link«i to anotNpr nwla or difaend M «g dnty^ ona^ fLJiiiL^pial Hnaa dirpctly 
link tho aaMidalitgl raaeWnM, ; A noda imi^p » If n ltoMti»Kn# to tha nodi 

It's nlwfcaut thk^ aM N a rrayi fN|»^P^ i»t*M ^ ani>y^ -®ewfC|aantly, 
it's valid to iwlrta th* ae1h% in a tub#r|^ at ar«v^^ly, imd trMt liia mHo 
outaida 9tfi iMTir at sKtM-Cfut^aii^ ipp^ 4 

Wo loeui en poframm^t l$pe oN8|arlMird arri^ wbaro aach call 
containt fMmm-fmfmtm ttm ms atfaetteig wNieti of tav«rial iparationa the cfll 
porformt. Bo^ cattt and arr^ »• viewad at having two functional layort - a 
loadtTig layn and a tffMSiing Utytr - \Miti) fpjstMif^i/li^Ai^ (^;||^ aiK» fpuHHOry 
eiamantt for Mch foyor. Of course, these layers may be piiy#cdHy intertwined 
At any ffven |N«s instant, only one of a celt*s liters is aetiv^li(i The processing 
layer of m amnf 4eiJs«d la mi>*^^^lm-Mm^hm^mmi^mm^if% ilwaeilitely 
useful to an array's user. The proeesi^ layer's ou^Mit and strte are a function 
of the proeessmg liter's input and state, |i|lt| He funemn statt - the state of the 
function-specification state bits. The funpU«}-tpecifi(»Mon state WU roey enter e 
particular function stfte whan an array is powered em after this, ttt^ miy only 
be loaded throuf^ tm of the arr^s loac^ inputs. The sole function ef the 
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loading \tm it to M tipt ^U, fn4l^^|||f,j|f^ ||p.^ri«imiiw1ormtd in tha 
processing ityor of tho trrfy, ThiJt |p |ppt^i|»o|HfttitndE»il« act at 
intarmaditrias batwaan tN io>#i iijif .^it^jgntii^ 
spacification bits ara tha only ciMl laflc ||iifii«|itiil,jti«#o|^M aittiap. layar. 
Typical usa of an arwiy in¥0lya9 MMK^ffi* lMn^lai|^||ij^^Mi9n ft^i^ l>its, 
thw-ak^ spadf)^ sorat fMQction. t^ I^BI^ l ) gr,|>nj»M(» lliaii^ tha 

loadar is qui^cant w^ajha {liocatrini jfifi P^mm^ ^l«*^ lla^Mia of tha 
loadw mfy ro-pri^am tha ||r(K^a«^j|^||||ra!^ fi!ii||,f|ip fMnG*fe»r^ 

For many aetipi orriiy a|ii^|^ |iNrt4i»«^^lfM,^^v^^ of 
an arr^ to ramain flxfd «tigp|* gi^ |^^ 

•^^t <«5^!« t»»§t fna|r|j|of 1^ fP|Hllfi|ti>lMiaf^t^ fMia 

P^oc^»« input-oi%ift ^ and Jp^gl^ ^pmfl»#f MMa «?ii* mould 
justifiaWy think of his »rM^:^MtJ^mim!l^^^ appdi w d ^ ii» tha 

procM^ng l^ary wij|^tt^fixad#t|^h||^^ 

machina. Similarl)^ a usar mi^t o{|| ^f ^.m^^^^^tti^ ififMita dM»^ an 
interval davot^ to loaing funs^on-«|fg|^^ 

think of tha array as an anyirofwa«a f%J|,/tt^ filp^^^l^HMMini layer 
during tN« Intai^^ flfpily^ 4^4ifpr oiig^M^^ 

loading in^tv an^ |fl^t|f», iJiiii||^^ aplnlMlval. 

Tha user 5oulfl tNnk of an iii^?adda4 bm^^^ a%«^i|»yifgH|»tfelaadtei©and 
processing laypt during 4hf:Jr^aryaU Jkhm iil|h|jiiipaafe *• natiir^efr the 
emiMddad machina profoundly affacU ^ l^miB^m^ Sm t kimMt^ M 
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ICfidw(«(lg0 tl tto ciMHi^alntt dnlMi ri#y a#iri| an intWval aff«cU 
t«stto)g «nd rtiMtfr ti pt^tttoiimly »i^ w* dtmiipr a MM^ to (iiaerRM Oim« 
cdfwtralrAs. ^ iwilKilr #MiMiMi i^^ ait lrt# dii^ a givwi tinw 

tntoPMft Hirn Ruling an Irvlttmd; ««•«• ari ^ %^riiftlfi iliafl^^ t»w inpiit- 
oit^ut («•#• ^mwm that connaettd to a uair*l nficMha mifht foe th« 
int«ra«tifig Ktiai ftM' li^ tiaar. SfmRarty, funeftan-apMHieation bits which would 
affact ^ luiifelfoii art #riy pwlorwad f or a utw niigM ba intaf^ting, whH« 
ftine»on>^»a^i«iMeir «ills ^ a rafnote Midiinif in aiti^ ni^ b« unintwMting. 
Thui^torMtiaili«riadlnteriMoratitir*tiftftindM^a^^ Tha itata of th« 
mraif it «)a teiNtffig of tha Iniin^, and tha giwmt* It imf rooaiva dur^l tha 
if^orval, a^iet wHKfr (^ % m«Rory i»atnantt and iiia-ait tinat ara rWrntnt; that 
isi whteh fiMy ^K!f ftttarMmnt alamanti during tha intorvai. An mb$d<Ud machnu 
ior a i^van am^iitiaierflMcf fc)y a Ittt of thOM ratavaht mtmdry alwnanta and 
rida-^t in^ Nhai who«i vaTaat ara Imown to ba fixad during tha intarval, and 
Ihttir atsodaiii vahiat. Tha ambiddad mathtna*! ihptit-oi^t iin*t ar« th« 
r»»a^«wt input-OuljMA «nat of tha ari-ajr tfiat iif« variabia di^ intarval. A 
prograNwiabte togle toidiflg niaehantiffi ntay tat Yuh^ti^^ ttata bits, and 

thwraby m^^mmmmfmS^fm^^^ l»Nx:Maing 

iayar ^ an array Rgfire 1.3 givat a charaetirliatien of oria aueh ambwidMi 
raachin* irratevant ^fut* and outputs ara ntk thown^ alM^ Ihiy doHl affact th* 
parformaHc» of «i» i ftb ad ded waeNna. 
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Fig. 1.3 Machine Embedded In A Processing Layer 
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Diff«hirtt erobMlded tmMnH may b« aquivalwit If a flawwi array it 
confifurtd to iffibtd a machint that it acMviltnt to • jfrffet array*8 wnlMdctod 
macNna, wa iiy thft fiawad array h^iMm^f^p^tHdUmktdii pmlmA macMrM. 
Our array rap#f it ttitt'ofara an Mf^muAofi p'veiMViMilt^lAltwi of 6«N% and 
not a mechanieitf aittfatton of an array. 

A reieoam conntttten neimrk, w nlcori netvorkth a aubnotwork of thai 
interconnMstion netivork that d«terii>a« iomnuiiiMrtkxf ki an ontbMided macMno 
(sea figura 1.% Aft in tho intwe|innaetton noftMerK dots oorrptpend to ceHa, and 
diamoncte corroaiMnd to axtra-army maghinat. ^ and on^ ¥ ^ JMti one rolovant 
connection direetlytiniii a 4»UwUh another ^«re}^i«p«a*f ay machine, a link 
connects the eiN*t dot to the apfrepriMed^ or diiRiend In th«rek»nnetwori^ A 
cell'a rHcm n«gUms are the ontitiet * cella or «ctra*array machinea - whoae 
repretentativea ar* direoUy miid to «i# i^i»^ dcTin Ifi >itf y^Mrerk. A ceH 
with n refcon neighbors is a call of rekon dtgttt n, caHed a rtlcon-n celi. An 
embedded machine's reteon is the Nghest ratoan^igree of any i^ Its ceUs. 

A qyiN«cation of our defNtion of an embedded mmm makes it more 

consistent with our IntuUve understandif^ of a reacNne; We ttiiJirm that tvvo 

cells in the same eRy»edded machine be connected by some path of relcon 

neighbors; that is, an embedded miehine*s reicen network must have some path 

between their represtfrtiMve dots. Thus two or more macMnes may be embedded 
in the swM arr^. 

An array constrains the reteon network of machines embedded in that 



T 1 ^^..^,jjyi. ^1 
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array. Two cailt can ba ralcon p^^0^j^J ^ji>pjitg^^^ in.^.lipra 
interconnaction network. Furtharmora, tha tat ^ a9(Bwa#joi||^ tJ^tat may furthar 
conttrain rdcon natwork.. Ajthwj)^ all.thf arr^^^j^wa |>r^^ \^y^9 a 
chMkarboard intarconn«etjon nptwork^Jha,^^^ 
associatad procaating-|ayar-amba<idad macMiMM varJM. An nrrMy ia i^nly 
intarasting if this maximum rakan daraa it at, laaal 2t^|jbarwita.cal}a jn an 
ambaddad machina don*t commMnicala with facti othar. 

An array ytn% fla)i|ra<calH( ^W^M^ cq^itnil^ tha ralcan natworfc of ita 
mnlM M tii imcHnM. ki nfMnf mwrm, ^mlff^ii it Hfond •rray. p,iMC|iliw 
•qUvalwt to m ipiidNKl fpr . pwtet «ny^,J«g,^^^i4 m«|,im,pnet 

.^^/^ I'^for^ ^g^ ■? '^^M^ fW p v^ ' ji^i*^! tf** 
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iS^Udft 1.^: tt#Loi(fBr And Re^«t«dC6n^ 

fmmsmiMm ^% i^MiSM to mtt of our imlkig, conftfuri^on^ 
artcf r60«rr tt0fif«tf^fti. In a frititfiiiiMon tiitti, oimm if^Nlt to t etti art 
tronimrifod uiK9ttlii«l m oulpiitt iHor o el^oy; ^ (t, tlto d«R icti iH(» on* or 
more wfroi cofirfiilbnf on rnput fo an output Tha daHt wa dlaeyM all hava inputa 
I and outM* d tittt eai^ la clakrfbad by iha^loitwi^ 

iU ffrst i»hm^ - U, R, b, or l - danotat ibna of a caiPs four oida-aata - U|9, 
Right, Down, oi* Lift fatit tide-set of a i^eri edl hat the tama number of loading 
inputs arid oulpuH M; aikf the same nxttmr of proeaas^ ihpiiii and outputs, H-U. 
For ^r I s K i Ni fTs true (hat U )t(fy Idk, W 6uk. 6r» db* and Olk ara assodatad, 
and given the tame name, thus wis migftt spmli of tfia Salact leadhg input and 
output of each of a eeil's ride-sets. In a /etbtoti tramsmsstm. smti each loading lina 
of one side-set isi trammittad to an assodatad loading output at one other aide- 
set after a difiy not longer than about one gati-detay. Thui a loading 
transmission state busses the loading input* of one sida^sat to assodatad loading 
outputs at anotiiMW' side-set Processing trtmmisston statu effeetivirfy dmnect a bus 
to every side-set'a processing outputs. Eadi bus connects the processing outputs 
of a dda-sat to tha aaiedatad proeaating Inputs of any ena of tha ealfa akia-aata. 
Transmission Unks are Important to our testing and repair processes in 
sortie arrays. A transrHilssion link is a procasaini layar's chain of cella in 



5»^V4**'>-"'*»»'-'*' 'V . !5yS^«{p*3f ?A,''j**wiR><I »'* * 
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transmission ststas thst acU in s "^iM^^iM imi.Wik^iik$%WsS^vi9^iswi^ 
input at on* of its onds to an assodatad prt^^Ssbif output at Its epposit* and A 
transmission link parforms tha sama bussing fimeOon In an array Indapandant of tha 
lira's path or l«^^ 

Our loading approach usas catls in loadbig fammitopion ilatM to transmit 

sofflo ralcomS and f»lcei»*»nisihtoias^ |i'sil i i i iiilitf llli^ signals from a 

tast macNna, such as a computar, to a tai t a d cJM . Tliasa sama Hnks concurrantly 
ratum a tas^ twITf i%i|Mnto baidc t#1»# tastad in tNa 

way ara rapairad by linking dustars of good««ll#Ma transmission links. 

An iiiiba(|dad m;mm9mMM^f^il^^ milgbbors (soa figura 

1.5). Tha antfs ii^vha»raicon-4, awil^ll^ttiar ^aMs-4i» th a aw i ; hava ralcon-Z Th« 
raicoi^2 calls ara tha arm's 6M(y. Th«»^«^ fif trmttJCfi^^ from ih« 

tip in tha ralcoh network's chainJ A loadbi| a|n is U84d|WlMd 0m^ calla in an 
array; loading signais flow from 1^ loaiiig bn#s basa to its|tip» whara a caH la 
loadad Wa discuss maNnas whic|i ara aafHy afbaddo0^|ar|w In tha procaaaing 
liQror f>f arrMtcNjia. 

Wa davalop a iosdif^ machanismi that cMi bai couplad with any 
progri^nmiMa logic procassing mochanism in an ^ array erf two IN' mora di m a naions . 
TNs loading maehartwn attews m^iiiimiWW^m^lis% parfaet» arbitrarily larga 
array by signals input to oni ^ anywhira fn ^irri^ '^HH ii^possibla baeausa 
a loading arm may ba grown to tha loattoiit^l. Tha loading arm Is an arm 
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Fig. 1.5 Relcon Itetwork For Two EmlM«i«d Arms 
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Fig. 1.6 Relation ivtwften Esdontiai Nttiwork And^B»o(;iM|4fR«lcc>^ 
8) li«(^9R nftwcH'k for ffsjrt 1 J^ ip|«K^ 
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D) Rttieon network for the tmbackiid aacWne in C 
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*'P". ■: I*^..**^l* :'Wi*Wfii'i ^^HiW.^* 9mf*m^ olw^ 'mt^-i^imi^mvdon^ 

the c^ tN:ou0 th|3i^y|,#§^ 

st^es. If U^ee JpfK^,ir||^^i|piif^,|(^ how the 

ueirs i9t^»r tyJiHM^ci^l^^^fi^ J>»|^ -ipi^fnlaifi |4iii8fti| rpiifi i iiti en 

ftet^in viMiJt^fr\|fi%^t|Mli|^^ ^mm»^m0l^ 

••ti-^^ThJitjIi, it i!^,bp»fij|)i^ ^i^^gifglf^it^/itgifiMm^m^ 

tip ceM, Jhu« % celij iip| |f«c(#cfe ii|^^,^j|i||i|^»ig«i^f^^ Itt 

loading aiTn to bo the nmv tip; the former t^t tofM i|P||i#fi ,4lii:P|INi«^ 
end the loecjini ipi Jt inc<^^ ^^igill^i A^^ml^ Miii|>itff^of an erm cen 
elf ft ciH«it,%L ap ti^li^iiily 1^^ l|p|g^pt<%lfMM^ll^illl|ethe 

,c|n,f)||§Bi:|.;.teii arw jj|^«pf ^ mfmkAm^immimimik Hmmmm^mmt^^^ 
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ehang* m t^ ^ tlw tm\^ Wtmm oMy ii1datlM|inil^ tip ^ e«n Imiv« 
ru functtdn «li|t <iiti|i4 atiM*i tonporiry^lWarlf%ri1|^ to 

l>»rmaiMMn^ «iM l^^anetton «tit». A tndlDf •rui nwjr i^ f«-io«rf eolit In an 
afff, In ^^dir%i^M£<il9niif« or rojMlr IHo array, ifi a flaviPod or rtrogularly 
•liaped arrays ttia a^i^ ii>^ )a ttuka iHtiigv iiarfnii fiilM, ii^^ i»^dijiid 
flaws and f^-io^ iMliifi iioei^^ ivi»^lt. IdliHtiiit-iaiMwr^f^liiyi^^tho 
loadan and d^MiPM«H«#«Miar <^^ 

Uaa ofaliataNiead loaiH' iitfHiiA^I^ facinftataa 

niael««a««aiNiiii| in^* 1ta«w«d i¥ay. A iomiitmMWMfkim^ Hiidbig 
afgnalt dorl¥ot Iram tli» moAo-aeiiv«b Maiieid railuni ofHia toadar. Tho mono- 
ac«v» fjiepirty af Uia toadar IrapNot tiwt a woiitfrt ^ ean bo ae^lvatMl^ loidodi 
and da^aettvatad l^aiaadlni ihirtt«ad%^ i^^M0-«m til ^ vaTa 

aid»-8ai«. ?l)a iaMcr^ Halmm imoina iiaf Alif i#%i#i iMir biisuii iiy bo 
^aod to a<^i«N a^^^M ^ (tiding 1^ «i# m^M flUoNd Mn^fiUn aliito, 
aufotiMttMnt^ adnd liadlng tignatt ttvough IHo iiif iw Alilf aldo^adt't toador 
au^Ki^, and di-^«^ilo the caifi loadir. Tha ^adar^a bdhnpo aRoMra a loadk« 
arm to ^inn^ m aaaio fiidini iiiaMid i#ii itf^^ 
ori^ifia^ IHratilli in array. 

Thm am ^bataneo* is dyad to indNiiti a oNf TMumiifii'a ftawftlUnal 
symmo^ «^ roaiMet^ Mt aido-'a^ A^l% prde i i t il iJliMiiint may bo 
baianeod lor aomo «r aif of ^ eal^ function irtaM^^I^^ 
% 0(4^ |}roeoaaini MaehiNaNi ^lliaali^ lafi booili^Mi^ ilielWf by ia^ 
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of statements rsipt^ftK^subfo^MK^^^^^ • • Oln - 

some ^ocesslng outpul In fuwtkw si|t« ^J^ ^g^^g^M W1iPlll«i*iP*#^ 
apiicable procMsing s^tM. li ^j>9i^»^^j^^^ ^f^^^^v^^e^i^^m 
set of statements - such as the permutaton tnterchanging L and U but keeping R 
and D where they are - yields a set of stiAeiMntt ttwt eompMeiy deaer^et aome 
allowed function state, the processing mechanism Is tatatoitf for strte S^ and the 
balancfr€tat«d function states generated by tiie slde*eet panmjtetiens. If a cell la 
balanced in every allowed function-state, the utl is Mtnutd. SometlRiea the 
construction of a ceN requires disallowed function statee. One migM, for instancei 
use four function-specification state bits for tNrtoen ittowe^ function stelM. 
Three function states might be incidentalty genen^edi useless, end therefore 
disaiiowed 

An example darifles the concipt of a procetMing meehintoffl's balane*. 
Consider some processir^ mechaNsm witii one proMsskig kfHit and one preeeering 
output at each of its four side-sets. One transrolM^ function state F^ of tMa 
mecheNsm is desoiised by the set of stat«n«^s below. 

ForF^: Oo -» 0,* lu "► Od, It -♦ Or, Ir -♦ Ol) 
An arrow indicates an input Is transmitted to an output. This proeee^ng 
mechanism is balanced in state F^ if and only if tite ceN has function states Fg and 
Fc such that the fcHlowirq; statements are true. 

ForFg: Oo ■♦ Or, Ir ■♦ Oqi II •* Om kj -» 0^) 
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For Fc: (b -♦ Ou II -♦ Oo, lu -* Or, Ir -» Oy} 
If the mechanism is balanced in state F^, it is obviously balanced in states Fq and 
Fc- We then say that F^, Fg, and Fc are balance-related. If the mechanism is 
balanced for every allowed function state, the cell is balanced 



^>^j •.^^^.. ^ ^' ■^■-'^•^ ^^ -»-**^.«rf.||s«3Ti«f^t%-^7:^^ 



• ^tWJII|^» 'ttMf 
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faulty behavior. Oip.,1^ f|i|p^|Qi Jliy|{^ j^ Pf|ie|#^;^||^ 
macWna'a control, and not by tlpwlt (^i^|^ |||^j^|||,,|i^,|^ 
allows appropriate tifnalt into a fiivte tet of ceNt to af^et loadkif i^ a cell, Uiere 
It some chance that faulty crils ^tM provide those ^pMls to liMd e Mil without « 
test machine's control, and therefore contradict this ataumption. We describe 
design techniques for rmkitm tNs arMtrariiy unM»lyi thte in^ves maidng Uie aet 
of valid loading commands smaHer than the set of pesllM e loedbig com ma n ds , ao 
that fault-generated commands are iikeiy to be #tebeyed. Another basic 
assLHi^tion is that the behavior of a cM d^»ands on UmA cdt*s state and inputs, 
and not on the state of other cells. Ow checkerbMnl arrays help assure the 
vididity of this Msumption, became no sipial line MMieets distant celts. A third 
basic assuR^tion is that a fai4ty cell is somaw^MA Mnsirtent in its faulty behavior: 
if a ceil is good whenever it or its neighbors are tested^ the cell must be i^od in 
the intervals between tests. If a gMd ceH becomes flewedt it RMQf not pretend to 
t>e good whenever it is tested. The first and third assumptions are met if a fauity 
cell's outputs ti\ rwnain stuck at some value. AnoOier assumptierv yiAM\ is made 
to reduce test time, states indepenctonce of certain mechaniMis in a c^t. For 
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instancy it's Miiimid thM the ttaU of a «m^4^iiMl'i^(i^ (^it Mt i^i^the 



,-i,-.,v' . v'>.WBWL5i - ->' Ti~ .«,*.,^.f%3gia(i5J«»^«s«<|«|«:i^-SS^!^g(^J^*J^^ V --"I? 



fAoeap 

S«etion 1.4c PracMiinf-fayttr MaeHtowt 

arm m?e4<iMi, wNch ara ptch^ that ar* aNuM MbiiMjR ^ m^^mf^^^V^ 
as armt coropoMd of Manewi c#ta, ^«ta mmnJHmm^i^MkJ^m ^^•nd 
repair. Proper communication of a dif^tai macMiio wtth onai oid«-t#t 9f on* 
woridng twaa cati in a ftawa^ aititrir8y.tar|aarrw^allaim,M andr of th« 
array. A ioading arm padualiy t}(tandi an Mbfddafl wa macNna into an array. 
Aftar Mdi oxtonaiorv Ow arm it taatad via praceaa l wg alpa^ kiohvoon tho Array 
Programmer and tlw bate of the arm. if yi>wto it yn pjt ic cewfiii, the ftrm la 
retracted md then grown through a nmv patti in tho Jpiwd ainy. The Nionef of 
the ceilt in the arm machine impfy that aN arma iMtt Op «me fiumlbar of nod fieda 
may perform the tame functioi\ indapandant of ttMir Mlh ttrough. an .array, for 
contider tome e m iaedded arm machine lull whete T function atetet are detcrilMd 
by the litt (F„ Fz, . . . Ft,). F„ it the fundion ttate of the am^t betei wMch ta 
the firtt ceil in the am^ and F^y it the fcaietion ttato of the Nth mU in the aim 

Ft I it the function ttate of the arm't tip. B ecau ae aH ttie Kmetion atotea are 

- -;; ._t ^ "- - %A' i.% J- • -', h«?s *-*-.,..-. 4 .irt"-r,&-|| '-. ,-? bfid -)„'av :. '^.i* 

baiancedk an embedded machine M2 it equivaiant to Ml If and only if M2 ia an arm 

•dj ?i.^'i" 4-»*i-'rf r-s 0"'" " it* :,#*» n® •'- il\v> ":; si^'tn »iiT ./"•-¥ ^-hp^^ -n 

machine that hat T function tt^M (F,2 F,, . . . Ft^, and Fiy it balanee-rolotMl 
to Fns5 f or aH 1 s N s T. 

Chapter 3 ditcuttet prograi^t Mwt fovem ap|ed<nr of^erm niaehifie% 
given reatonabie modeit of flawed behavior. I^epelr of arm arrayt it ttudied 
through a program that timulatet that repair (tee ftere ai7 and a^8 for pi^trea 
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of a repairod m^. In rtost flwMwd arrayt with N perdmt flawed catlt, s N s 
25, thlft pfoirl^ tibedi ah ami iiiMrii bbnkil^ti'od - la N> parcmt of Um 
total edit \n ^ im^. liilt pirfonnanbo can l^"^gli^ hmpravML Whwi N ia 
greater than"atl0^ittfiaftil^ 1^ Wf ^"iiii''ari1£iilfiii dw te MObMldaid in 
a chackarboardT arri^. 

Chayffir d alto dTlcuMCi o^ latuat riavant t6 praeiicai rwllzation of 
arm machinaa. trt pirlllctilar, ft proMintt a falH^ dlMid fitilf for raaNntlon of 
enormous co#(ilar>r«^fabla, variabta-lanfth tMft-rniitifrf in a aingio IG 
package. Arm ^ttiine realiiationt are appropriate to many micNnet whidi are 
realized as a ^^ flFf mbdiles, wi^' ^ach modile owMMfiiMi^ at moat two 
other moauies, ikf 6rily ^ Moclkiis at tha in^e^ i» ieliiNi diridtiy connected to 
the nMkchine*s infMitt and eutisut*. Hiwny fl nidfeiMW i ittimi caNyMr amy* have tMa 
characteristic, so thi^ couM be appropriataty realtted as arm nMfihinea in a flawed 
checkerboard array. 

An tssenttai maehtru is a perfect machine em bed ded in • proceaaing layer 
that is descrlbttd is a machine composed of essentfal c«N^ that i^ wired together 
in some way. The tsstntuu etUs of an embedded maeNne are thdae cells that are 
not in non-branching trahsMssion stales. The four ceHs in the upper-left square 
of figure 1.3 are the essantial celis of that imbedded mecMne. A «>fr« cerries 
relevant infofmition befw«en ah essantial eatf and its issmttai nngkbor, which it 
another essential cell or extra-array machine. A %vire is a dhrected signal pith 
from an output; this pa0i is either diriict <t>etween {htercohnee^ neigiibora) or 



.'■>»_ ». -,',->"'y-J^ 4jtM » t>ltt J«i'-!Kr-T «■»!?<.•»- Hl^(«j|M»«(ifN-.-'~;rffl<»-^.^ f&r,^--- 
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indiract (via edit in tran«ninion ttatM). Th« outpul of thf Jfpift.^ t|«^j|pi« 
cell of »|ur9 1.3 it wMj^r^JiJ^jpn^^ kidlrMtly to 

M ois«)ti4 fWKNff it^ippo^i^l^ 
rn^r^s. Tho arm Ritii^iiot dffciilfi^ ^ftfim'^mmsmmiimikmkv of 
•«foriKolj?»U«. |,^jlili^ t^jf^fsl^i^mim0b^m)h^ It 

!" fW^*''=^ «tiito F^ Ijt^s i,iimmmM^iHm0mU*9^mi^^ 
If it it^w ja i^ii^ J Gf|%iii^iijpirp|^ |i^^ Wi«. 

^w'lw;^ IftKT^.^ f^off #,^«cH^ MlNvorfc 

for ||i| jti^Jtlfl, m>^tH|.if» ligyft ^ hfmm^^K^mkmfl^ j>liHd l y fflf on 
•wi^tMl joM^ j|^ i^|t» joithi ^iquift:^!^^ jl|Kfii|% jrfi^ pw i wi i nl aiwll In 
the obvioMi w^. Owaon^ §^pfl(4 fy mlm-wm mwi^Mm M ^^^^mt^Vm^rm 
connects an OMentia^ ceM> t^jtol ||^||i|^||i| mimm^i:9lfi^9^ •« 
•'•*!^^ 1^^ #W^wr«i in jtfc^ ^•ei^i|#ft\|pf^,>|i. INbtiMP4P%ai np«]& An 

— ^fl''^ '^^^"^^ ^m^^m^i^^SmmMif-SjBi^^^mm i» an 

associated reicon network. If two cells are essentW nal|tibfiii jfiuPt ^Miptlal 
n»t\AfO{:H^.«i|Kh relcoA r|i^5|K inuit^iiM Mi^iJIii^tmkw mwmmA n^ etaMtlil 



-s^kw^Wf^^-i^W'-^^i" -f,- — ^^l?3^3&e5^:^*^ ^1" V s Vf- ■ 
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«ft« or ntor* in#t»). 

htgh-relton nuuktfus. AH aiiib«ddiii|8 of a M^i-r«teon iiiMtt diiln iciiy edit 
wttli tl«r«« «r le# m^Mi MliNi^i#i-^ir1)iM>^fi^ iiipif^ #roaKlur# only 
r«cogwte#i iif ii iiia<il %iiKd^ ai itidvllirif If tiM# fili il dWii •ilMit laNai 
ami eM^ if«^#tlhalifli^#iwK^i^^ of 

machine, th«Ni li ««i Mf iMy eni'iaitiiBpMiNrii ^#yiidMiy^fiMt« that 
otr h^i^ri^MtW itlMirfNM^^ for ovMy 

«4r« ill m&-n0^irmmi--mi«imlM^imfd^''9^ it dfti ifd ^M l^ia Ih an 
•quNNHMrt flwohttia. $mi^ ^tMMiliaf M^MI^ af i^rlii^fitei^iM^th^ aro 
y»tMi(iAe«Eli ^^r^^afiitf^%iiarMil%M#l«tif N^iff^ iHw^fl^iMi llato. 
CorrMisftfiidNi iataNltal ^Mtc wi fiirtii ^ (Mf ist^li^Mb^tM M ii^tra- 
arrariiii»MAiM4fft1lii Unii viwy. Hiif tha MMi»i af aiadtiatid wiraa may diffor in 
e<MvakMl «M9ifilii^ mMrm ^huf^ ^tibdMlii ^ M|| i#Nei^ii«lBl thia 
faeir 4)^ ri0«ir^^pnaiiiaiHl %)or«M fHa'iiiiNMiii^ iJF^t|^>liit WilM^ iMIt 
df d^faranl a»aii^ iMs f#^A0ii{flr iiii 1^ It raiMiMafiii, aft^ a 
(toiignor fypii»Ky t|i«<^at #i# lifvJiMitriiilit^ tfitlt ^\i porform a 

ctori^Mi funetfoa 

Qm rai»air af hl#t-ra)eo(t riMeMrNHr ri#iNil' Hif i^iiriirtH^ that tho 
iongthi^«iaee(ated\Mira«iN^dmaf«i^«qtil^^ Wtii w* 

mwrtian^«hni<|0if ivIt^haH^astura Wt n^lifiiri W ^H^WUkmm^ th« 
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most appropriate high-rateon macNna architacturat. Tha aatumption th«t th« 
length of atsociatad wires may differ in equivirient wnb9<Med machlnea ia not 

■ r ■ J 

required for repair of arrays emlMddin| the other machines we detail. 

Uke chapter 3, chapter 4 focuaet on teatinft confl|uratio^ and r^fftir. 
We show that the mechanisms that provide these facilities iH^e very close to 
coniparable mechanisms presented for chapter 3*s frrfys. The loedera ere 
functionally identical. Testing is accompiishejd by growth of transmission Unka 
between a test machine and a tested cell. A tfst ttnk is a transmission lif^ 
between a test macNne and some celi in an arriy. The baea of the link coonecta 
to the test macMne, and tha tip of the link is the cell on the opposite end of the 
test link. Each ceil in the test link conducts signals to and from the tip end of the 
link, where a tested cell may be located. The test Hnk is |rown es the bociy of a 
ttst armt which is a test link terminated on a e<^l in a '^-tifn" function atate. 
Signals from a test machine into the base of tMs test arm flow down the erm*8 
body to its tip, turn, return to the base of the arnn and ond at the teat macNne. 
The test machine uses such signals to monitor the growth of e test Mm The 
kM^anced states of cells in the arm allow it to flexibly sneke around flawed ceUa m 
it grows from a test macNne to the side-set of a tested cell. Test links ere f^own 
to all the accessible side-sets of a tested ceil. These links allow a test macNne to 
monitor the tested cell's behavior in various function stirtes, which are set by the 
loader. 

Repair of amys embedding Ngh-relcon machinea la aqqemptfahid by uae 
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of tranfmittf^ ttattt to SNif togothor ottofltial noifhbort which aren't 
intorcormactioA naighbora. Exparimanta with a rapair aiffwiator aiiow ua to t>egin 
to compare tha rapair coata invoivad wilh dffarant aaaantiat maehinaa. The moat 
difficult chacKHboard-array rapair iHvaivaa aMMddbig a high-ralceii machina 
whose easentiai Aetworit is a grid (see figure 1.7.A). We call such a mechine a 
grtd wuKhtnet and ciil its essential ceits grid uUs. the meat general repair of a 
checkerboard array assumes that a pid wW be ei t ibe d dad in the erray. Many Ngh- 
retcon ma^Hinea have esseMlal networks with squares and links missing (see figure 
1 JJO. Weil siil that these machines w% wkmiikA in a flawed array more eeaily 
than grids maCfUnes. This shows that itnowladge of the eonab^nta on relevant 
inter-ceii communication paths during an interval fadtttrtea repairt but may require 
re-repair when the interval encte. 

Chapter 4 also details an array of simple ceils designed for the 
realization of arbitrary dgitai maehinM. Otiwrs have deaeribed Infinite arraya that 
may contain inittaity-flnite machines eapabie of performing eny eemputation and of 
constructing other macNnes that can perform any computatien. Our array la the 
first one we*ve seen capable of embedding a univeraai computer-^onatructor- 
repairer. A computer may be embedded in a finite portion of the proceaainc layer 
of the array. A function state that transmits processir« inputs as leading outputa 
provides tNs computer with a loading arm (TNs is the only time thet our prevloua 
description of programmabia logic is Hightty incorrect, in that proceaaing inputa 
may affect toadlng outputs.) Under tMs embedded computer's control, the loeding 
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Fig. 1.7 Essential Networks For Two High-reicon Machines 



A) Grid 




B) IMon-grid 



^ 



Fig. 1.8 Relcon Networks For Two Equivalent Tree Machines 



A) A tree with several branches 



^ 



B) A tree that is also an arm 



^ 



PAGE 37 

arm works with four tost arms to tost, progrwn, and repair tiio comput«r*a 
environmont. Tho maeMno m^r cwittnict nw imrmm^ fwr ittalf by iMing Its 
loading and test arms. Furthermore, twojrrfllQi^Bonputers emisedded in an array 
may tmt and rofwir each othw. We b<M^ 1^^^^ ^ embedded medibie iaw*v# 
designed as the procemer of a unlv«^ Mmpu^-cwirtnietor-r^aeirer. 

Chapter 4 also discusses practical predueUen IsdiMi i^ ^^leation 
weas reievwit to hi^v-rsiran modifies. 

Cha)»ter 5 cRscusses processif^-liQfer machines called trtt maekitus. 
Random-access and traek-ackh^etsed se<)uentlai-acees« memories may be 
efficiently realized as tree machines in, ^aiiyed irri^ This is true because tree 
machine realizatiena are appropriate to machines which m«y be viewed at a small 
set of modules with a common input bus and common ou^tA bus, with the oi^put 
bus accessed by orriy one active mo<^e irt a g^ven time. Each cell in a tree 
machine is a balanced, essenUal Mil whose funetien state Includes a unique name. 
Ait embedded tree machines have tree-like mb0tm^mii'^ liilbli' titworks In 
which a tree b-unk, which may or may notrhavti^fshoot branches, extends from 
the tree's base ceil <see figure 1.8). A tree's kast ctU is the only cell thet is 
directly connected to the input-output lines of the tree mecNne. Two embedded 
tree machines are ecMvaient If and only if they have the same set of ceti namesi 
the particular shapes ^ their tree-like reteon networks are irrelevant Thus an 
embedded tree macNne whose reicon network is an arm may k>e equivelent to en 
embedded tree madtine whose relcen network has several branehes. Tree 
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machines are embedded in flawed arrays more easily than arm or high-relcon 
machines. If there is any path of good cells between two good cells in a tree 
array, those good cells may be incorporated in the same tree machine. Interwoven 
test and repair processes for tree machines are like those for arm machines. 
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Secti<9n I A Arfi^ f^^Mir 

arrays. iMnf ^e^prt^mi^li»A pmfiS^ei^ 

rnethttfe for iimfmmi'f^mmi0^Ssi^m^ IMff <^tliii»iififtil^ i^ 
custom maialliz^lon^ but s«m um propwwiwd rttptir. Soam Nava concai^atad 
on nac«8«ary and sufficiant eoncNtiont for taatabiUty or cMagnoaability of a 
particular typa of array. Wa datign caN medulat vvNch ara ineorperatad inte an 
array to anabi* taatingi loadings and rapair. Wa aiaa pr a t a nt Vm firat ^atamatie 
traatmant wa'va taan of tha affact an ambaddid maeMiia'a cofMnunicatlon 
structure has on tha tastobytfy and rapairab^ of an amy far ambadttog thai 
machine. 

Wa dascHba how constraints on ttia wirbig batwaan aaaanttal caHs of a 
machine affact tasting and repair of an Bntijf aw b a d ing ^lat maehkia. Chaplara 3» 
4^ and 5 comider this qua^on b^ focusii« on three r^i^ad classes of nwchine - 
the arm, the grid, and the tree. Figure 1.9 indicates hew these three eleases 
relate. Given a ftmMid arr^ is to embed a given ^pa of niaeNfia» wa medal tha 
repair process in the following way. The flawed array is viewed as a /lav p«tttm 
(see figure 1.1 0.A), with a dot correspondbig to a good caK and an X correspondhg 
to a flawed ceil. The machine to be embedded in the processing layer is 
associated with an esser^al machine and a class af aqulvatent e mb e dded maeMnea. 
In considering repair of an amy, this class is rasb-teted to embedded macMftea 
whose dimensions anew them to fit into the ftawed array. The nature of tMa 
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machiM. A par|ia4lar ambwicM iMchbw i| c|i«|Mffv.|i^^ 4Kml»!plim»«l«M^ to 
that tlM inKWn«!i^4#cof^in#WH(«jm»^^ iMtNotitny X 

The array it rapairfd to ambad that mashbia. For in^tam comidor cnriMdcIng a 
desired arm machina with 13 edit; aH mmtM&d ambaddad machinea are arms 
containing 13 ceila. Only tome of thafalii|iik|d macNnaa have a relcon network 
that fits into the flaw pattern of figure 1.1 Q.A. Fieire 1,10.8 riiowa one such 
relcon network superposed over the fl^n^iMittem. The asaodated embedded 
machine may be embedded in the flawiiij|r|y;.^ 

We noted that the nature of an eauivetenee dasa doMnda on the 
function states assodated with a given amy^^or kiolaneab b da nc e d ati4ea may 
expand the size of an equivalanee dast i|||^|f^Qre tadlttito repair. Arma ml 
trees use bdanced cetis to fadiitate te^Jifi repair. The balance of ceils in 
transmisdon links facilitates r9p9kr of a^ifii embeddb« Mgh-relcon machinea. 
Figure 1.1 0.C shows a 3 x 2 grid roadf% jmibedded in a flawed array. The 
relcon-2 nodes in the network j^ntipOMili tmiiidnlin ■Harot.^aiwictwd to grid 
ceils, in pid-ambedding, cdts used as ittieplvarhMd MMdiled with rtpdr. 

For every flaw pattern ancT^^^f essei^d machine, there'a an 
associated opttmum uptttr tfftcuncft whic|i^is4lle h^Ntl attdnabie ratio of the 
number of embedded essentid nodes to Ifiii^i^ of dots in the flaw pattera In 
figure 1.10 the optimum repdr effk:{en^'i#^/14 ter grids, 13/14 for arma, and 
14/14 for trees. Ut OftEe, OREt. and ORE^ be the ofrtiaium repdr effidendea for 



PAGE 41 



Fig. 1.9 Rel<BAiimd^w«MGr{dS|TrMS, AndAriM 

If a grid's essential network has a certain number S of squares, ail the grid*8 
associatwf ¥ift#«A ifetvllffts have aft teatt $ ndilei. Fer ilMSh of i grfd relcon 
network's N node^ Uwre we one or more b'ee Mbnetweriu with N nodee. N ie 
greelir^'MiM mml^m%'^m iri«b#i ^flie^lNMi-'MyHMHtf''^'« grilff teleon 
network we arms; Mch has n or fewer nodes. At k«st on* arm has S nodes. 

A) The sNinf^eit retoon network of a grkt ' H» essentid network 




B) Another relcon network iw the stme grki 



^m;^i»<^ 



C) Tree sulsne^orks of A's reicon network 







D) Tree s ubwetwerte of a's relcon M^wwk 



■ ' ■ ■ ■ 

nil 
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Fig. 1.10 R«ptir Of Arrtyt With Tht Sam* Raw Pattern 
A) Flaw pattern 



• ■ • ) 
■-X' 



X' 



B) 13*nocle arm In^ flawed array 



^X 
The repair efficiency is 13/14. 



C) 6-8quare grid in flawed array 



♦ 


t * 






X 




4 


W 





The ^M» p^terQiita».l4:^d»lK: fM iatelMiiiieiP nei0Mfli:4p» 14 
used nodes, but 6 of these are esaenMI AedM and 8 of these are 
relcon- 2 overhead nodes Meeclalecl mrttti trenemleeion Unka. The 
r^Mur effidmcy is therefwe 6/14. 



0) 14-node tree in fliM^ed array 



■ ■ T 

Uxl 
The repair efficient is 14/14. 
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grids, trees, an^ wrna for a givm flaw pattarn. Because of tN> relalion between 
grid, tree, and arm machines noted in f^e 1.9^ OREq s ORE* s OREt ^ "V ^"^ 
pattera CiMpters 3 and 4 explore repair ^fidency a^tahed by programs tftat 
simdate r^pik for arnM, «id for |^ and oXtm highH^elMn mai^inM. OmfAmr 4 
compares the results of these experifatii^i J^t^iMp^^jNid t^^ 
exploration of testing and repair argue ier Mlns oriented, wrfien peasHBle, toward 
limited recMremer^ en the comnunicdiln'iiatha between a rneeMne** essential 
cells. 

Chapter 6 summarizes the thesis, and suggests further produetten* 
oriented and theoretiMl iM'oJects. 

The next chapter provides context by exploring other systems 
approachesi t^iapmng Vtmia to tttis on% anfl ipn^ierN^ evoh^onary trends whk^ 
suggest thet this appreMhwili b e co me inoi ' i a idi i l y sM Hirt i ve L 
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CHAPTER 2: CONTEXT 

Section ZO: Introduction 

This cha|>tor pi4« ttlM worii J^: Mfl^^ ;«f^ t(»f«i«vant qrot«m 

approachM and avoljiiliofiary Iram^ %9^ l)ifP9«|||t.«t$iUu(aif/ arri^ft «r« 
discussad, and the r«|at|on ^ our appfic^ |o llia«« |ipfamf(iN>i if delaUad. 
CatUMlar arrays and convjptioi^ JC ty^^nii^airi^^^^pfp^^ of caiiular 

arrays on a silicon slica is sho^vn to bt sltiiiMr 1% jb;^^ fabricate of 10 
circMlt "chips" on a slicf. Four ayoi^ofip^, tff|||i ana c Ha f j ifS f d t rapicMy 
Incraaslrif capaMlity of intafratad cifcg^^Bi|s§|^f^pc# ^;,#^ 
machines, mass-production of a few Ngh-volufpi^^pp<feiMyH •■'^ij^ci!^ 
regiiarity. Our i^^proach is viewed m a ^rstams approach taitored to the realities 
and trends in digital system design, mafii^|MI% iffifiliii^ maintenance. 
Oth«r efforts towani yary high inte|ri|i(yvji*l^|^iil4rf^ «nd«iNar machinea 
are reviewfdi and th^ are c^Bpa^^our ippcM#v V 
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Section 2.1: CaiMw Arriys 

2. LA introduction 

This Metion Joeitof our c«tHilir i^itteh in 1^ itoftiain of cdlulw 
arrays. Wo focus on di^nelions hir array ilHorcihfiii^^ cusioiilislion^ ai^ and 
function. ¥^M»flf cansidar «>e oirrailt steto «f pt^fiiiHd array ^aliina. 

Th9 Mavtor ^ a €^MUr irriy dipindt W ttif ^tNirti caiatilMty of 
its cells, and lUi^ IfitaKonriactton. Siica aMi^^ini^ is eurrif^ inoit isutted to 
impl«(ytentati#n of a ^fs funettOTt, wa dascriba balls uslnt terreiponding 
tartninoiogy. Howwcr, tha approach appHm U funetiaifiih^ tiqijiyalint irraya 
raaiizad )n other ^ehntilo|ias. 

2.1.B Arrior Intarconnaction 

Arrays wi^ many different typM of intarconnaction hav4i been studied, 
but 1- and 2^dim«isional arrays are moKtMniori Vr a ehiy»Miti array, a cell 
may send signals to and from at most four neighbors. The oApoInt array, and 
other arrays with the same type of si^ flow, have been extensively studied 
These cutpotnt-conntcud arrays have the same interconMction rwtwork as a 
checicerboard array, but signals may only enter a cell from Us left and upper 
neighbors, and leave Uie ceil to enter its lower and right neighbors. We chose a 
richer interconnection structure, with its slightly higher associated coat, for several 
reasons. 
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1) Most machinftt racMr* f«wf^ C|«l|t «||^.ii^ «m«^mI cMayt in 
checkerboard arrays. Cutpoif^-con^ted irf|p arf^Ui)nf|ad by the fact 
that an operation on the oiitpuls jpl i^ c|f))t can^j»jt i»i p^rfoffied 
above the lowest of thfse c«iiSi ^ If ft pf J)ip i1g|tiiMt,f|f tin^ callt, 
without external connections for Jjls purpfff. pteffqw^^pard «r««ya 
don't have this liraitaJUfn, bf«i«Jl|ll tiifijC^II ei^Mtf In ali four 
direcUons. TNs irjeans, for inslanc^|^||i|i|||||^ of an 

embedded sequential mfcMnfiMy 
array. 

2) Signals from an arbitrary ce(|. in $ pfr^^.w^trfi^yJarge wray 
can cause loacting of an arbitrary cpll in th9 ffCfy ^nly if ther« it an 
interconnection path from the ioadjnf cell to tht iP0fd ci^. TMt 
important capability is thereforf ipip«|9i|4i fn isptpAiM . 
arrays. 

3) flapair Is more n^f}^M,(^^jf»^m^^ ^J0m^ 
set of possijt!4e procts^ip trwismiiii^ 1^^^ 

The ch^erboard arr^> inter^fniii^ojn •bpe^«,^J^^ coinpitMile 
with the two-dlmeniionai, step-and-repeat n^ui^ of I^P|fdMptifi«L Fiflhirfnore, 
tNs structure is relftivi^y eify tf ukN'^q^ jHid jp^^ 
instance, hsKagonal two-iimfn«i(NialftructurM- 
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2 1 ,C Custoroizatitm Techntcpes 

Am^har aspoct df cdTular arrays Is tMr cuatoralzation technique. AN 
but tha simptMt arrays hava the proparty that aach cat! can ba cuatomizad via 
memory alawaiits to ana of a sat of function states corraspondiiig to varioua 
output functions. Thus an array can be custoinlzatf to reaHza a particular 
ambaddad macHifne b^ one of savarar cuit^Uon tedM^ 

Uiwltarafaf r cusbniizall^ri lata in IC pi%dij^on1s a common, extreme 
form of array cuitomMon. A ooiwiion teM(|tja ui«i iMMll^ 
masic, fusible re^ai links, laser, or mechanteal scribe. PelyceU, llieromatrix, Read- 
only k^mory im*% md Pi'ograromabia Logic Arrays (PLA) provide weH-lcnowm 
axamplas of this ipproa^ Because such custonitttian ia unaitarabto^ daaifn or 
customizi^M arrof s can ba part^arly disastrous. 

Programmabia ROMs (PROMs) acMave r^tar flexibility by allowing 
customization that is alterable, aibdt currently difficult One such technique uses 
FAMOS transistor which can ba put in I of 2 conduction states by appropriate 
electric signals (see <F«itnay 72>.). Intel guarantees each b'ansistor to hold its 
state for 10 years. High-energy ultraviolet light or x-rays can erase these 
memory elements for subsoquant (^Frograiyii%^ ^^^1^^ include the Ngh 
voltages, long write-timas, and difficult erasing aaaociated with the FAMOS 
tran8{st(»^ ^bppHy, S^arHay k^oor of tiitaf iq»ilts tftt^^ii-pre#amnwb(a. logic- 
eraseabia FAMOS transistors will ba daveloiiid sOott 1tiia w^ 
great advantage of a logic-compatible, read-mostly, nonvolatile semiconductor 
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mMiofy. 

Progrwnmi^t Logic (not to bo confutod with Frogr onwwo b l o logic 
Arrays) providos the ulUrilito%)tuMiii^aiiriN[)iii^«it^i^^ 
volatility. The arrays protontod in this thoaio f|ra f |^ of |!ir^|^NiiiM|}lo logic. 
Ro-cuttoroization of progrMMnaUo to^ it at oa^ •• loadhig Its funetlon- 
spMification atato bits. Wo Joyolop p j^^ toat and ropair 

of arrays of prbgrammabla logic Buitdin|^tMt 4*d rmir mochaniams into m 
progranimablo a^ay am provMo ^^^ coita ttian thoo* 

assodat6dwith|aM|fi»(il}lyci|MioniizadMa|Bitftde|'«i^ i 

Boca^ta a practicit tfiffTtfil^ logic would 

probably bo reoHzad via samieonducter toehnetogy, and ifocauoo aomicofKiuetor 

- i * 

momories are currently volatile, programmablai lo^c U civrently volatllo. 

\ . \ 
Development of logii-ccmipa^e iiMv such at a 

modified form of FAMOS gate, would offer bigLi|^^^^a|||^^|y|pf|p9Pf^ 

A further difficulty with programmable loic to Ks Inoreeaed ceN-CWaya 
compared to metal-customized arrays. TMs is trie bee4iiM there are deleya 
associated with the selectiorLi>^1t#Kiic»^4^^ («M figure ^l). While 

there is no den^ jthis dTfficutty, tvra fit^ ameMitoM aituetion. The first ie 
that the delays ithroi^h gater A «nCB Jaj^^ii^ 



I 
be made very sflieli 



4.^,.-. . 



because these elements can be desipied assum^ Fi, alfunctiofi-specifiaitlen 
state bit, will nop change state during normal opai^l^ of ^ omb o dd e d macMne. 
TNs means that icatt^^dalar can briboiit 9M p»=^^ lo that logic 
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Fig . 2.1 Two CiastisraisiitiQQt T«<cilmiqpBNfi 



A) Metall'*cust(»iizc»d 




B) pi^ogrwfflsable logic 
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gatet ar« becoming incrMsini^ fatt, aapaciaHy whan daaignad for a known^ aiinpl% 
low-load anvironmant If Josaphton juncUon gatat baeoma practical, Uia axp«etad 
delay through a gate of about .1 nsac Is the same delay as that through 3 cm. «f 
wire. 

A final probiam with programmabia logic it its demand for extra g^ea 
for loading function-apacification state bit* and selecting a particular output 
functioa These gates consume an intepated circuit*s area and power. The fact 
that extra area is required for these gates Is offset somewhat by the fact that 
programmable lo^ minimizes non-circuit proramming tadlitiesi such as the np^y 
area-conauming bond pads required by soma custom metaiNiation techNquaa. TMa 
becomes more significant as shrinking transistor geometries make bond peda and 
other mechanical customization components occupy a relatively Ngher gete area 
The power consumption probiam is alleviated by the fact that function- 
specification state bits change state infrequently) in tome te^hnologiea, an 
element's power dissipation is vary low whan the elemaiH It not chfw«ing atat% 

Because of the pin constraints on ICs, most prqBoaais for loacfing 
programmabia logic attempt loading via electric signals through leeds at the edge 
of an array. Chapter 3 reviews the most attractive methods that have been 
suggested for programmable logic loaders, and gives a loading approach with 
advantages acWaved by adding a small amount of circuitry to eech cell in the arriy. 



Size is another dittinguisNng attribute of ceikilar arrays. A particulw 
arra/8 size depends on the size of each ceU and the number of ceils. The beet 
measure of an iC ceirs size is the amount of area it occupies, but tNs meesure It 
too dependent on tedtndogy, designer, desifn time, and design aids to be useful in 
preliminary estimetlon of the size of ceUs. Consequently, the normal meeeure of 
size is gate-count of the ceil. TNs measure has limited vahie beceuae of the 
variable types, number of inputs, and density of gates, and because of the tradeoff 
between Input-output lines and iptes for a ceH performing a perticuler function. 
Nevertheless, several authors have used gate count as a means for roughly 
classifying arrays (see <mnpkk 87> and <Mukhopadhy«y 71>). They distinguish 
between microeeilular arrays, in which each ceil contains only a few gates, end 
macrocellular arrays, in which each cell coiMns a large number of gates. 

The cells presented In tNs thesis use few logic elements end few 
function states. The loading mechanisffl, the only mechenism common to all the cette 
we discuss, is shown in figure as. It has a minimum of about twenty gates end five 
memory bits, with slightly more If ioadtog options are InMrporated. A proeeeeini 
mechanism of any size and complexity may be combined with the loadw. The 
ectual complexity chosen for a cell depends on ttw envisloAed appHcetion^ end en e 
tradeoff between yield and overhead circuitry. In the memory erri^s we've 
designed, this tradeoff is the main consideraUon in determinbig how iar^ e memory 
td put in mA^em The universal mpmmm^ ^mmAmmim Vm 
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hundred gates and memory bits, and only fourteen function state-. This simplicity 
increases cell yield and reduces test time. 

<Mukhopadhyay 71>, <Kaut2 71>, and <Minnlck 67> have compared the 
number of cells of different types required to perform various functions. We make 
no such comparison here, for many of the functions we perform cannot be 
performed in other proposed arrays. All our techniques are applicable to 
arbitrarily large arrays. 

2.1.E Function 

Various functional categorizations of arrays have been made. These 
include consideration of the functional capabilities and the time behavior of ceiluler 
arrays. 

The most common functional classification views an array according to 
its ability to do combinational logic, memory, or more general sequential machine 
functions. <Shoup 70> discusses this in terms of "generality" of the array. Our 
testing and repair techniques work for any ceil generality. We discuss some 
memory cells in chapters 3 and 5, and a sequential machine cell in chapter 4. 

The chapter 4 array is able to realize an arbitrary digital machine. In 
particular, the array can support a finite-configuration universal computer- 
constructor-repairer. That is, a finite number of cells can be programmed out of 
their initial quiescent states into an embedded machine able to perform any 
computation, to create a new, disjoint embedded machine able to perform any 
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computation, and to do these tNngs in a faulty array. The embedded macNne*8 use 
of a loading arm and test arms allows it to test and proram its environment It 
can, for instance, enlarge its memory by proper loading of cells. <Rowan 73> 
describes a ceiiuiar array, of more compHcated celts, that Is computotion-univereal, 
but not ci^sable of c«istruction or repair. 

In a synchronous cellular array, all cell states are re-calculated 
simultaneously. Several synchronous arrays capable of supporting universal 
computer-constructors have been presented. Von Neumann's 1952 pioneering 
work. Theory of Sttf-reproductng Automata, presented such a 29-staie automi^on (see 
<Von Neumann 66>). <Codd 68>, <Gardner 70>, and <Banks 71> followed with 
simpler cells. WNIe theoretically interesting, synchronous arrays are peripheral to 
this thesis because they are currently impractical. Since state changes must be 
synchronized, many technologies require long clocic lines linking ail cells to a 
common clock. Signal transmission is severely limited by the clock frequency, since 
a signal takes at least one clock interval to propagate from a cell to Its neig^tbor. 
Thus the tranwnission delay through a ceil in a synchronous array is at most the 
reciprocal of the toggle frequency of its memory elements, which is much slower 
than the transmission delay of one gate-delay associated with our asynchronous 
arrays. The overhead circuitry for all proposed synchronous arrays Is high. 
Testing, loading, and repair appear to be more difficult for these arrays. 

Asynchronous cellular arrays are far too numerous for extensive 
consideration hera <Minnick 67> provides an excellent early review. In a more 
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recent prftsentaUon of a theory of logic detiin vvith ceHuier m^fh <Mti|(hapac9pyey 
71> summarizes and analyzes some of the major ctrilular arrays. <Kautz 71> 
discusses various arrays for arbitrary logic, including aecHJentialmediines end 
sp^ial-purpose arrays mai^ of the desii^ are his Wfvn. 

2. 1.F Current State 

Cellular arrays are already widely ueed. Pppidwr \C arreye MMid no 
customization (Random- Access Memories), SMft-Regiaters), or only a simple 
customization step (Read-Ordy Memoriei^ FVoprjammabte Loflc Arrayi^ (aee <tAiecke 
73>). Tlwe are aiw a few systems using mar^ ^s, such as the Hiiec iV. 

However, many proposed arrays remain p^per-studies for verioue 
reesons, including current impracticaiity and iC industry inertie. The charecterietice 
of some of these arrays have been discussed in Ms aectiofv as badiground for our 
approach. This approach overcomes the difficulties of many proposed celiuter 
arrays. Its ioadinf, test, and repair circuits, and their astpcialed program^ are 
compatible with many airayi that have been propoted. 
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Section 2.2: Arrify Fainication 

Fabrkation of LSI checkerboard arrays is similar to fabrication of 
conventional integrated circuits. In the conventional approach, hundreds of 
identical iCs are batch processed by selective doping and metaltlzation of a wafer 
that is usually 2" to 3" in diameter. Typically, a icey element in tMs complicated 
process is use of masks to selectively expose photosensitive material on the 
wafer to light Each mask is formed by photoraphic reduction of a pattera That 
pattern is formed by use of a step-and-repeat process wNch iterates e ioesic sub- 
pattern throughout an array. £$fA\ sub-pattern eorrespohda to one of the Iterated 
iC*8 masks. 

Each of a wafer's identical circuits contains bonding pads, which are 
used for probe-testing and possible connection to the iC package. After a wefer 
has been batch-processed, each of the identical IC "chips" Is tested vie electric 
communication with a test macNne connected through probes to the chip. Those 
chips that are defective are inked The wafer is diced along horizontal and verticel 
scribe lines into component chips. Those chips that have been inked ere discorded. 
The other chips are packaged and retested. Those thet pess these final testa ere 
ready for use. 

For a checkerboard cellular array, a basic circuit is similarly step-end- 
repeated to form an array of identical circuits. However, the patterns of edge- 
sharing neighbors overlap slightly to allow lines to interconnect neighbors. 
Because most of the identical circuits, ceils, communicete only with their edge 
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neighbors, most do not netd bond padt. Only lifjM thil i^Mqr.b* ut«d lor mcUl^r 
array communication naod bond pads. Scriba linat t^fbli^ Pfil* ^* uofiapifiary, 
for tha array naad not ba dicad. Scribfi Unat ««^.pae««WNy bi^wf^i) of 

a wafer that are intended to be parts of diffwwit arram. 

Most conventional iC padufM are eventm^ eonMctoi to other, fin^itar 
padcages. Thus a given chip's life-cyde usually inekidie tiatd^prpcessing ^th 
identical cNps, separation from them, and eventinl re-oonneption to other diipa. 
We*ll see that there are many advantages to a syftMif approach thet doesn't 
require separate handling of each cNp. This se^iratl^ ia now required for tvvo 
principle reasons. First, conventional ICs caiv»t ftjndieD, properly if any of their 

components are faulty; tNs necessitates making a cNp sfflell enough ao thet 

'-. .. -:-, •.-; -•■-- ■-' ' ■'- 

there's a reasonable diance it will be perfect Second^ a system of small IC chipa 
requires a variety of cNps; a slice contains only one Wm of chip. . 

TNs thesis* celluler approach aitaiibirtes ttie need for separation of cNpa 
in many cases. A slice is designed so that electric coiNmuniertiof|^with e digitel 
machine allows testing and repair of the sUcei fauRy rfj^ons of the slice can be 
tolerated. Chapter 4 discusses such an arr^ of idonticpl, sim)ie ceMs thet are ao 
flexible that a large enough array can perform any ocj^ajputatior^ ^nd can test end 
r^Mir its cellular emnronnMmL 

This thesis focuses on checkerboard arrays desinied to tojeroite 
defective ceils. Cells are programmable logic ceiiSi in each working cell, 
programmable function-specification state bits specify the fgnetion that the cell is 
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to perform. Qieft ah array has baen twitrnd^ alMtric comnHM^cation of a t«at 
machine with ir Wttan hun^r of cetit ai^j^where in ihe array tettt the entire, 
M-bttranly ivgtf array. The testing machine uses ^ same communicetion links to 
program the array to embed a perfect ms^ine^ by appropriate setting of function- 
specification state bits. The same md/cir other co m m unic ation linlts then provide 
the inputs and oiAputs of an array-embedded maeh^ realiied as celis in proper 
function states, fte-customizing Uw array is as sNnple es re-teedb^ its function- 
specification state bits. Sheuid an array machifis become defective because of a 
malfunction of its circuitry, it need not be cHscardeef. its tinlcs can be used for 
testing and repairing the array. Because this repair can be done eiectriciiiy by • 
digitai macNne, repair can be automatic, st and ardtoad, and quick. Repair can even 
occur ttvoui^ comratMieatim between an array and a remote test machine, indeed^ 
the universal array of chapter 4 can accept embedded mediinea that teat end 
repirir each oUier. 

The array can be viewed as a bin of spare parts, celta. For a large bin^ 
there Is a high probability that a certain percentage of parts wW be good Thus an 
array is fabricated to have more than enough parts for a partici^ar envisioned 
application. The avaUabiiity of spare parts which can be elecbically switched into 
active status allows the realization of IC packages with more functional power. 
That is, Ngher integration is attainable through automatic repair. Furthermore, this 
spare-part capability fadtitatas re-customlMtkMv reiiabiiity, and maintainability. 
For many types of circuit failure, an array can be re-programmed to reaume 
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p«rf ormanM like » perfect arr«y. TMs tUov^ fraceful difradittion of an arri^. ' 

SImpllficttien of tyttwn produetten ar^WilfittMNdir ir# lii^ via 

vfJlat^!pawat^ w(MWIW e# 'd ai lp i ^ iMl drauiiry ttial 

: ttigMkirpf<Bail-p f| i igid 4tMPt.3|fa^ii^ 

jr'iWi li l a ip h lii i i p Mr inw ^^aiipitaritaff il#%NWiyr^iipilioiiiMl If ^wte 
array tatting, cuttomiiation, and rapair via aiacMe eaMMfkation «litW1^ar 
array. Tha standard, ffiodul# ni^^ ^ it ih^ and m Amy Pngrtmrntr impHM 

. '^1 ,ii '- in aiWiaftaoiirfBttanal <r «allyiyt &1 ii i t 6iid^ d li Hl g l lc i ti l -d^lia an 

.<iaaBtrttdii|ll4jJihatii»^ aa« i »y^^ «w 

prodgw%;l<» f t iii| » r« c ia il ip yfclti^iili ^tmtMlimbMMsi^^ifi^^l^'i^^ of 
^4li|fo^3«^aittwdaNiataian#ipalidlMll«i.^'!^ 3'^^^^ ^ 
--^ ::;. .^rMmsm Kn» diiocM«#i«alM»4##ti|||,Mfl^'P "Wiir^^N^ilinla. 
. i^ »i l iM#i i M^ « iy »aaiPOflcttft-a i<aMi t i^ :#ti^«eC»« i ai a<>i »i i <^ 
>^^ li^iandti|»,^inlHri%Miiif%'«ndiaMiNil«t#$a^^ iii^lhat 

^.,tt||||MPlllMaliy^oa^pdfbNluR^ p^l^iiii;^ 

.^ilNMir«iiii^ol4ittl«3iifn^eanp^«' - -: ^.i hta? -^.jri^ios ^.-::;. r:^•..;.'■1. ',..■.-* 

v.. :Sp#.iiatefit%^4diaraelorlia#lfpir*i«iiiM^^^ 

throughout tha wafoi^ aro iiior#lNi^f^l^#ii iMl^yU'Wii Yo^y^blo 

^,^tura^;-^^wigawaafeicoiiiigBW^^tiyfty ^ ^ Thoy can 

:^««mdtrfc«nH.Jer4Matenei^ 04liiat'l>ai^icii^^ 
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conMet |Mlf^ of t^ li^rfer. 

If a waf tr cQ«^^|fMit,9n^,pg|,t (^^ 
daca-aaaa of cl>^ jrt«Wfc at^iwii^li^iiy |^^ a 

function nf «»tiw^ /...w.„>.>>^t-«^*^^,^ -tfp /mfl itilltll TIT liutll Hm U^ 

j^eld tmKi^|MHpiiiii4|}i4||«p^ mift iHwiiiMlj thummmui m liirtiiwitiiliiiL 

Y is yioid, tho ratio of mdmm^mm^m^f^HimBmaibmmm^ 

tquar* inch of .att^ A^i^mh9fM^im§i^mmm^mm'^y^ In 197?, 

B«cauf o thoro aro #^»i^^«Mi flijNilMMfilM #m« ^P H^imr 
tebricitiofv teff^ oro o^iisyM4te^ IC tyoiim. 

Wafera are dr^p^ aini bioMNi #a Ihay^^ai^^iip^ 
Perfect (^p« •« Pfi^ed iH-^iiWlii^itiri^ tgiting 

rraalabelled. (One erq^n^ told us of a p^e ^w gli^i jiitMi puttlrttt titd^1<il| tHoWil ioi 
with difforent cootools in Iho ••mtm^imm^i^ammimtm*'^^^ ww-* all 
read-only rnemorioa^ Pvloct ICa are«Mwire#^pMbioj8i^s: 

Qreat iKpfote fa lwjuR»d.#^ 
flow of matoHals. Omilta are d*ili»ad widar n^ A |irliii» 
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one is the need to rranimize pins leading from the cNp to the extra-K^ world. TNt 
allows lower packaging and lead-boncHng costs. We recopiized the need to 
minimize pins when we developed a loader that recMr^d only a few pins. TNs is 
also a major reason why our testing and repair processes require access to only a 
few pins. Because yield wkI cost depend on mar^ variables, includr^ time, thee* 
figures can only be expected to point to some rdevwt condderations in \C 
production. More detailed analysis thm is ^^propriate hare can be found in 
<Hodge8 72>, <Murphy 64>, or <Seeds 67>. 

Table Zl Chip Yield And Manufacturing CosU 

(assuming S20 slice fabrication cost and 200 defects per sq. la) 





SSI 


nsi 


LSI 


Active area (ails) 


38 K 38 


Bex 68 


128 X 120 


Chip size (nils) 


58 X 58 


88 X 88 


148 X 148 


Probe yield (percent) 


88 


58 


IS 


Chips per 2 in. dia. slice 


1288 


488 


158 


Good chips 


9^ 


288 


22 


Cost per chip 


t8.82 


t8.18 


t8.98 


Package (DIP) 


8.83 


8.84 


1.88 


Assemblu 


8.85 


8.85 


8.58 


Final test 


8.82 


8.82 


8.28 


Accumulated cost 


8.12 


8.21 


2.58 


78% yield at final test 


xl.4 


xl.4 


xl.4 


Factory cost 


t8.17 


t8.38 


$3.68 



In a March 1 975 letter to us, Hodges made the following points: Die cost hat 
declined slightly; and inexpensive plastic paclu^ng is now widely used for L9. 
Factory cost of LSI components is now in the range of SI to S4. The overall 
picture reflected in the table above hasn't changed mucK 
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Key points from the Hodges paper relevant to tNs work include: 

1) Packaging, assembly, and test costs dominate wafer fabrication 
costs for all types of ICs, even though Hodges* figures neglect pre- 
diclng test costs. However, wafer yield becomes increasingly important 
as integration level increases. 

2) The absolute cost of packaging, assembly, and testing becomes 
much Ngher as one moves to LSI. 

3) Packaging accounts for a substantial yield loss. (Hodges is 
probably oversimplifying in assuming the same package test yield for SSI 
and LSI (see <Camenzlnd 72>). LSI tends to use packn** with more 
pins, resulting in more packaging errors. However, improved assembly 
methods for LSI relative to SSI would tend to offset this.) 

Hodges makes the important point that "engineering and merketing costs 
are probably the dominant factors In the overall cost of LSI today; this will 
probably continue to be true as long as the market life of product* continues to be 
short due to rapid Innovatioa" TNs factor is partlcularly important for custom LSI, 
where the interface between the IC house and the IC user can be very clumsy 
(see <Camen2ind 72>). <Mostek 73> notes that Mostek*s charges for custom 
tooling of an IC range from 820,000 to 855,000. The exact amount depends on the 
manpower and testing demands made on Mostek. Delivery time for a small number 
of prototype units ranges from 6 to 9 months after the start of the customer's 
interface with Mostek. 
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We present a strategy for realization of large, Ngh-yield, low-pin ICs 
via automated, electronics-oriented means. Because customization and repair are 
electric processes acting on a standard IC array whose cells are designed for 
testability and repairability, costs can be low, system production can be quick, and 
the inevitable errors resulting from a plethora of con4>onents - ICs, testers, test 
programs, etc. - can be minimized 
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Section 2.3: Evolutionary Trends 

Evolutionary trends in digital systems reflect shifting realities and 
priorities important to digital systems planning. These motivate an interest in 
electronically testable, programmable, and repairable cellular arrays. Those trends 
most important to this thesis are summarized below. 

2.3. A Trend 1: Rapidly increasing capability of Integrated circuits 
Table 2.2 indicates the skyrocketing performance and complexity of 
integrated circuits (see <Altman 74>. The table assumes a rectangular chip, and a 
"device" is a transistor. Prices have dropped with the rise in IC performance. 
<Moore 74> states: "One thing that Shockley was interested In doing was making 
the 5-cent transistor. At the time, it seemed like a goal so distant it might never 
be achieved. Many people thought the dollar transistor wasn't in the cards. And 
now we sell transistors as parts of an IC at a very small fraction of a cent - 
probably 1/100 of a cent." This development comes from introduction and 
refinement of various IC technologies - for instance, bipolar, MOS, magnetic 
bubbles, and Josephson technologies. Each technology has inherent physical 
characteristics that develop as it competes with other technologies. Function per 
unit area increases as function per circuit element and circuit element per area 
increase. Furthermore, reduced defect densities allow the packaging of larger 
circuit areas. In a 1973 MIT talk. Bob Noyce of Intel observed that the trends 
toward lower defect densities and higher device densities - each changing by a 
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factor of 2 every 3 years - had helped cause the doubling of the number of 
transistors in the most complex commercially availd>le cNp every year since 1959, 
when there was a single transistor per cNp. 

This rapid Increase in IC capability is a fundamental cause for a rapid 
increase in digital system periformance (see <Turn 72> and <Kosy 72>). 

Table 2.2 IC Evolution 

Typical Industry capabi I ity 
flaKtmum clockrate (Mhz) 
Transmission banduidth (Ghz) 
Speed-pousr product (pJ) 
Comp I ex i ty 

tiaxlaium chip edge length (mil) 
Device density (mllVdevice) 
(laximum transistors per Chip 

2.aB Trend 2: Increased reliance on etectronic roeeNnes 

Rapid improvements in tC capability, combined with other factors tuch as 

rentalism (see <Toffler 70> and <Vischl 72>y have fostered highly volatile, 

competitive IC and digital systems Industries, marked by rapid product 

obsolescence and burgeoning markets. Those factors have also shifted syatem 

design priorities toward reduction of mechanical and hwnan labor by use of 

electronic devices. ICs are Increasingly used to reduce otiier costs of electronic 

systems, such as int^connection costs. 

The replacement of mechanical machines t>y ete^riieit ^ea il porvaMv* 

(see <Foss 70>, <Toffler 70>, and <McLuhan 64>). The cilculator, oath regiator, 

and watch markets provide conspicuous examples. MicrolectrOnics provide apee^ 
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size, weight, reliability, and cost advantages in these information-oriented, low- 
energy applications. 

Increasing dominance of human labor costs in total digital system costs, 
coupled with a need for shorter design and maintenance times, have spurred use 
of microelectronics to reduce labor. This is part of the society's overall move 
toward automation <Bell 72> observes that: "In contrast to technology, system 
design costs have risen; this shift is demonstrated by, for instance, the decreased 
emphasis on minimization in logic design, but on the other hand, reliability, mass 
produclbility, and maintainability are now the important design criteria." <Fran8on 
74> notes that the shortage of IC designers is forcing system designers to use 
automated help. <Hodges 72> marks the dominance of engineering and marketing 
costs in the overall cost of LSI. Labor-intensive software costs increasingly 
dominate hardware costs in digital systems. Bob Lloyd of Advanced Memory 
Systems told us of the trend in the IC industry toward silicon-Intensive, computer- 
intensive production. <Vischi 72> lists rentalism and "shortage of technical 
manpower and the increasing expense of salaries and training" as pressures for 
higher system reliability. 

Of course, the relative advantage of ICs in reducing other costs depends 
on the particular application. For example, the merits of reliability via extremely 
reliable components built into a redundancy-oriented system are much clearer In 
an aerospace application than in a commercial one. Table 2.3 gives the relative 
costs of efforts to insure various levels of IC reliability (see <Peattie 74>), The 
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categories, which i^ange from commercial to captive line, represent a spectrum 
where "the basic factors producing considerably different failure rates are several: 
the device design, the number of inline process-control inspections used, their 
level of rejection, and the degree of reliability screening." 



Table 2.3 Relative Cost Of IC Reliability Efforts 

Part Class comm. c b a capt. line 

Failure Rate (X/ieeBhrs.) .1 .85 .886 .883 .881 

Cost 1.8 1.3 1.8 2.8 4-6 



<Peattie 74> also gives Table 2.4, which demonstrates the wide range 
of reliability expenses which various applications demand. It's a table showing the 
cost of detecting and removing defective semiconductor devices at four stages in 
four types of system marl^ets. 

Table 2.4 Cost For Failure At Various System Development Stages (S) 



Market 


I ncom i ng 


Board Mount 


System Test 


Field 1 


consumer 


2 


5 


5 


58 


industrial 


4 


25 


45 


215 


mi 1 i tary 


7 


58 


128 


1888 


space 


15 


75 


388 


288n 



Demand for higher integration level is consistent with the increased 
importance of ICs in the total system. Higher system integration results in fewer 
ICs to handle, test, and store. Decreased system size and weight have obvious 
storage and transportation benefits. (For instance, many IC houses ship IC slices 
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overseas for F^w-cost packaging.) A less ^(pensive printechctrmiit or wire-wrap 
board is reqal^vd^ Usar interconnect costis, wKch may run as fi^ as 50 cMits per 
TTL gate, are rsdticed (see <Noyce 71>). "Siriise bond faituret account for up to 
55% of all IC ^fM toitorest a significant reNabilfty itnprevemer^ eau be made." (See 
<Colbourne 74».y Higher integration also reduces packi^-reb^d failures (ibid). 
Improvements' In device matching, noise immufiity, driver eh'cuitry, power 
requirements, irid transmission delay also result from higher ftilegretiorv fewer 
pacltages, and ^twer intercorvMcts. 

Timm tare also disadvantages to high integration l#vels. A higher 
number of ein»^ d«n«its per input lead miri(es a ^sten mer* difficult to test 
<Vaccaro 74> nsHm "the major drtemna facing the t»er of LS( ladi^ is simply that 
we can build mid are biNlding rnicrodrct^ts today ttwt are m(N'» complex thm we 
can adequately test, functionally or pararmtricaliy." <VJnscm 74> notes the same 
difficulty with tHe Increasing complexity of circiHt bowds. I^i^ ^tegretion levela 
have implied f#gher investments in IC cNps. For instwicet est^mi^es of the totirf 
development coiii of Intel's 8080 microprocessor are in the td^lon ddlw rmge. 
This high investment ph^es Ngh demsmds on the device's design and reliability. 
One poorly specl^sd component can resdt in large losses. Ffigh carryover from 
one IC system etes^n to its successor is necessary to reduce the staggering 
development C0«t8. For a given technology at a given time, higher integration 
implies lower yield, because of the higher probability that one of the meny 
components will be effective. 
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2.3.C Trend 3: Mass production of a few high-volume components 

The realities of LSI design, production, delivery, and maintenance argue 
for the mass-production of a few powerful, high-volume components. <Fran8on 
74> notes that Fairchild and National are emphasizing standard ICs over custom 
ones because "they get a better payoff for the engineering time and capital 
expenditure." He observes that "the big IC producers aren't after the business 
unless the volumes are high - 100,000 per year and up." Each time cumulated 
production of an IC doubles, its unit price tends to drop 30% (see <Luecke 73>). 
For high-volume devices, this is chiefly due to a learning curve. This learning 
curve, resulting from detective worl« into ways to improve production, argues for 
high-volume devices. <Vaccaro 74> cites availability and product data-base 
reasons for his conclusion that "it is clear that considerably more attention must be 
paid to selection and standardization of fewer proven device types within the 
Department of Defense." <\/inson 74> observes the maintenance difficulties 
associated with a plethora of ICs, circuit boards, and testers. He discusses 
designing for testability and maintainability, arvi suggests standardization of system 
components, test procedures, and equipment 

In his 1973 MIT talk. Bob Noyce compared the production of high- 
volume ICs to the printing of money. In a 1974 MIT Project Mac talk, Rick Dill 
agreed; and compared the production of low-volume custom ICs to choosing 
engraving over money-production. 

<Noyce 71 > observes that design of standard components with 
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capability in excess of many application needs allows the economies of mass- 
production to operate. 

In \fi9 1973 MIT talk, Noyce also demonstated the thirst of the IC 
industry for hrgh^^^ume markets. He noted that the annual sh^nping rate of micro- 
computers was^ ^ice that of all other computers. He observed that whereas the 
computer marki^ was 2 x 10* gates per year, the calculator market was 5x10* 
gates per yesr and doubling every year. He estimated the walch market at 10* 
200-gate unite per year, and the smart-phone market at 10" gates (2 x 10* 500- 
gate units) per year. Microma, a watch compmy, is now owraKi by Intel. Severd 
Intel people have told us of IntePs eag«'ness to enter the smart-phone merket 
The 10/2/74 ^»ton Globe states that "National Semiconductor said it's enterhig 
the electroniee ttmepiece market with a full Kne of quartz ##tal watches and 
solid-state al»-m docks." 

Various options are currently open to designers of systems with 
volumes too low or design times too short (20 weeks averi^e for custom - EtM 
to justify a custom IC approach Use of standard SSI and MSt^ OHnponents, with 
customization via choice of interconnection of the components, is often used for 
low-volume and/or high-speed applications. High speed comes from customizetlon 
of the system tot the task, parallelism, and Ngh-speed operators. Disadvantages 
include the large number of parts required, interconnwtion costs, and, in many 
cases, the problems of nonstandard systems weVe discussed. Another approach 
uses customization of a general-purpose conqsuter via a memory-stored program. 
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This provides lower speeds than a hard-wired device. However, it allows use of a 
small number of high-volume components. Microprocessors are increasingly 
dominating this approach. Chapter 4 presjsnts a cellular array that promises to be 
between microprocessor and custom-interconnect systems in performance and cost 
for many applications. 

Rick Dill and others have asked a key question: "What should we mass- 
produce in LSI?" MSI and SSI have readily identifiable functions suitable for them, 
such as adders, gates, flip-flops, and shift-registers. LSI is very suited to memory, 
with its regularity, low ratio of pins to area, and high volume. Bigger, faster 
memories are evolving. They may even have added power, such as associative 
memories. <Moore 73> indirectly observes the need for embodying low-volume 
systems in a high-volume IC: "We expect LSI to give us some very large building 
blocks, such as high-performance processors. Once that point is reached, we could 
go on to self-contained systems, but I question whether systems will be 
economical except in specific, high-volume applications." <Noyce 71> predicts that 
in 1980 a 100,000 gate "superstar computer's needs could be satisfied with about 
10 bipolar LSI chips. However, even if the capability exists to put all of a 
computer's logic on a few chips, this doesn't mean we'll be able to find a practical 
design philosophy that will permit us to do it - and that brings us back to the 
whole interconnection problem again." That is, interconnects between IC 
components can consume large amounts of design time and chip area (80% Is a 
good guess of percentage of chip area devoted to interconnects for current 
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systems - <luecke 73>). furthermore, the fact that several layers of metallization 
are normally meociated with a higher level of integration Is a prime contributor to 
low yield and reliability. These considerations lead us neturetiy into the next 
evolutlon«y bwid. 

2.^& Trend 4: Increasing Regularity 

Th« trend towffixl component and system reguiwity is evidmt Oigitai 
systems use far more memory bits, wNd> are orgwiized in a regular fMhion, tluHi 
less regular combinational logic elements (see <Luecke 73>). Functions 
traditionally pei^rmed by "random logic" are increasingly performed by arrays. 
For instance, microprograms embedded in memory have talcen over many 
computers' control functions, which were formerly done by random logic. 
Programmable Logic Arrays, extensions of read-only memories that allow use of a 
regular array to realize complex combinational logic functions, also evidence tNs 
trend. Increas«l interest in array w^tectures evidwiee this trmd on a t«^r 
system level. 

There are many advantages to regularity. Iteration of a simple 
component, or cell. Is consisent with mass-productloa It allows concentration of 
system efforts on • component that can be optimized, rather than distributed effort 
over a collection of equally important, different components. Simplicity of a celPs 
environment allows concentration of effort on design of the celt for that 
environment A regular environment is easier to understand and test, both for men 
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and computers. "Repetitive layout contributes to the realization of high circuit 
density per chip." (See Carr 72>.). Regularity implies constraints on metallization 
paths, which usually implies less crossover and crosstalk problems. As a 
technology improves, a more thorough carryover of investment is possible between 
designs for a regular machine. (<Lathrop 70> makes many of these points.) 

These generalities are supported by the rapid evolution of memories. 
Their regularity allows concentration on optimization of the memory cell (RAM, 
ROM, shift-register, etc.). This optimization is helped by the identical environment 
of all memory cells internal to a memory array. For instance, the impedance a 
circuit drives can be standardized and well-controlled. Once one understands how 
one cell works, it is easy to understand the entire array. Production of the array 
can be performed by replication of one of its cells. A memory array is relatively 
easy to test. Memories are so common and fundamental that every designer 
knows their important parameters. Should technology improve (by, for instance, 
improvements in yield and reduction in cell size), a basic memory cell design is 
easily adapted to a larger, denser chip. The transition from one memory array to a 
larger one is relatively easy compared to, for instance, the transition from one 
microprocessor to a larger one. 
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Section 2.4: Trends And Arrays 

Tfm advantages that make celiuiar arrays interesting derive from their 
nature as an iteration of functionally, and usually physically, Ictentical components. 
We*ve alreac^ explored the motivation for the trends towerd mass-produced 
standard parts and regularity. These reasons, appHcakrie to cellular arrays, are 
summarized kMiefw. 

Advanti^;** of mass-production of a standard part include: 

1 ) High en^neering and marketing efforts 

e»q>ended on a Ngfi-vcrttmw p«i 

2) i^i^r uufornwty in prockction |M'oce«Bes 
^Mlgher availability 

4) Reckiced inventories 

m M#e thoroughly characterized device, wNch hnptot 

a) leaming-curve-related jnp'ovements In 

diili^ and manu^ttture, and 

b) expanded data ba|e for reiiablUty and faUure-roode predletlona 

Advantages of regularity include: 

1} AR advantages of a mass-produced standard part 
^ Siraptififid connections 

s^ easier to design and produce 

b) Ngher component density via reduced interconnect areas 
9 Controlted, simplified environment 

a) iterated element can be ^timized for tNs enyironroent 

H easier system to understand, test, and maintab 

4) Performance increase via numerical component increase, 

implying the ibtlity to incrementaliy add performance 

5) Greater carryover betwem de«p» 

When cellular arrays are used for realization of combinational logic or 
sequential machines, they have added benefits. Controlled customizetlon of a 
flexible, regular, mass-produced devke imii^les more stages of mass-production 
and a faster product development time them for custmn realization of an irregular 
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i 
special-purpose machine. Furthermore, arrays can provide higher parallelism than 

single-sequence computers used for the same function. 

The highly structured nature of cellular arrays also has inherent 
disadvantages. When an array is customized to behave like a particular machine, 
more components are involved than for a custom-built machine. At any time, the 
cellular array has a lower density of active components than the custom machine. 
Furthermore, the array has more parts that can fail. Of course, component costs 
are decreasingly important in digital systems. If the cellular array can be easily 
re-customized (if it's programmable, as ours is), this added capability cannot be 
considered as useless overhead The enormous overhead of a general-purpose 
computer processing one instruction at a time is acceptable because of its 
flexibility, generality, and computational power. Use of the simple structure of a 
regular array to facilitate test and repair ameliorates the increased probability of a 
component failure. 

A second general disadvantage of cellular arrays is "an increase in the 
length of wire through which a signal must propagate in comparison with 
conventional logic not possessing stringent interconnect pattern limitations." (See 
<Spandorfer 68> and <Hu 73>.) Again, the enormous inter-operation delays 
associated with single-sequence computer makes this problem seem less severe 
for programmable cellular arrays. 

<Mukhopadhyay 71 > mentions another problem associated with cellular 
arrays: 
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"Anther major problem in cellular logic is standardization. We now 
have at hand a technology which can prockjce arri^ of cells of vmy 
lari^ complexity, but we do not yet know how to tise these devices 
ef6{^«f% in FN'actical designs. This is because of the rapid growth of 
the ntymber of logic functions of a cell with the number of 
inputs^mitputs, so that when a cell has more than 4 or 5 inputs, one 
simp|y^«loes not know what to put into tfw c#H in m4m to obtain a cell 
thi^ may be widely used in digital circuits." 
This point is ewMenced by the plethora of ceti-deslgns proposed 

This ttiesfs addresses this problem by identlfyifig and satisfying 

important recMrements on a cellular array. We develop nraehemams for automatic 

test, loading, and repair of cellular arrays. Various ceHular arfays Incorporating 

this machinery are presented. These include memory arrays, and an array capable 

of supporting universal coroputer-constructor-repairers. We show practical 

advantages of these cell designs, relative to other designs, which argue for their 

realization. One such advantage is that these standard arrays can be electrically 

customized to a wide range of customer needs. Realization of and experimentation 

with a good cell design would clarify many of the issues involved with cellular 

arrays; a better understancfing of the most important parameters wodd result 

<Mukhopadhyay 71> mentions another difficulty that he sees. 

"A cKfficult problem arises when one has to locate and correct a faulty 

cell in an array. Since the number of test points or the input/output 
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pins available is very small compared to what can be expected of a 

circuit of similar complexity built with lumped components, the 

difficulties in diagnosis problems have just been compounded. It seems 

unlikely that good and practically manageable algorithmic solutions to 

this problem will be developed In the near future because of what 

seems to be an inherent contradiction in the objective: programmablllty 

and flexibility which can be achieved by increasing the cell complexity. 

This implies an exponential growth of fault types and correspondingly 

astronomical growth of the number of tests to be applied to a very 

limited number of test points. The logic designer of cellular arrays will 

have to accept a certain amount of failure in the circuit and will have to 

invent synthesis procedures for fault-tolerant circuits." 

We believe that this thesis refutes this argument. We present simple mechanisms 

which enable test and repair of an array. Particular machines which incorporate 

these mechanisms are developed. Furthermore, many existing array designs may 

be modified to incorporate our test, configuration, and repair mechanisms. 

Mukhopadhyay's argument seems to argue, by implication, that large random-access 

memories are untestable. We assume that a cell's behavior depends only on its 

state and inputs, and not on the state of another cell. We try to justify this 

assumption by constraining our design; for instance, our design has no signal 

busses connecting distant cells. Our independence assumption is analogous to 

testing assumptions made for current integrated circuits, including random-access 
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memories. TMst tndepWKlence assuff^tion eni^es specification of simple function 
states to alfow individuiri test of an array*8 ceils via leads attadwd to a few cells 
anywhere in ^ie array. Leads to one side of one ceil are suffi^ent to allow the 
repair of mo^ wrays. Our technlqiKM irflcnw t)M v«rification &I our independence 
assumption by testing a complete embedded machine. Complete arm and tree 
machines may be easily tested via their inputs and outputs. Test links may 
connect intenwl p^ts in a hii^rs^con madtine to cdls at the «igB of an array. 

FiniAy, mar^ have observed that truly powerful arrays need many cells. 
Current reattliee eA IC f alM'icatim therefore sus^est repairable odlular nrays as a 
means for ad^evn^ large arrays. As densities Bndi yidds in^e^ builcfing a little 
circuitry Into a osHUn' wrrvy to hdp testing and r^Mrir beccmtes more appropriate 
We fociM xm mimM^ elecb'ortic r^Miir becMjwe of Hs IncrMMNg ettrectivenesa. 
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Section 2.5: Testing And Repair 

2.5.A Non-cellular 

The increasing difficulty of testing digital systems was noted above. 
Reasons include the plethora of digital systems and testers, their increasing 
complexity, and the decreasing ratio of test points to circuitry. At one time digital 
ICs could be tested by monitoring their output for each possible combination of 
input and internal state; Ed Fredkin recalls when many engineers insisted ICs 
required at least this much testing. This is currently impossible for most ICs, due 
to the astronomical time for such a test (2'°* tests for a serial-in, serial-out 100- 
bit shift-register). Consequently, less ambitious approaches are now used. 
Common test sequences assume one logic element's performance is independent of 
the performance of other elements in the circuit A gate may then be tested by 
putting its IC into a state in which that gate's output, a function of Its varied 
inputs, affects the output of the IC. This approach is especially useful in go - no 
go testing of ICs, where one only wants to know whether a circuit meets its 
specifications, and not the causes of faulty performance. Most procedures that 
must locate faults, such as maintenance procedures, assume a single fault (see 
<Marinos 71>). Because of feedback paths, sequential machines are particularly 
difficult to test. 

Development of a new testing program for each new IC or digital system 
is increasingly distressing. This has caused many to ask for a system-oriented 
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testing approach designed into digital systems (see <Vaccaro 74> and <Kautz 68>). 

Faut^t^hendling techraques typically use systems composed of a large 
number of comf>onents, some of which are defective. Two fm|or fault-handling 
techniques mrnrnptsk and fai^t-tolernwe. 

Ri^fimr techniques are characterized by fault-detection, followed by 
some form of ^^i^assing of faulty components} so the system's output then 
depends onl^^m the output of its good components. The best-known repair 
techruque for dlgNtd ^sterns inv^ves detection of a faiity component In a system 
designed to ^latode only good coroponer^s; a human then vi^stitutes a good 
component for Ihe fautty i^n^jonent We focus h«« on i^iiiraaches wNch» like 
oLTSi allow (kthg^^m coir^Hxients to be asso<»ated with a work^( systwn. 

FayH-4oler«vt hardware techniques «m at premier hardware performance 
in spite of f^H^t« which occur during the hardwere*s opm^tion. Standby 
redundancy "employs several functionally identical modules, some being used 
actively to perform the function, the remainder walttng to be switched in •'>ould 
one of the a^inw modules fail." <See <Carter 70>.) MaiAd«t redundancy "is 
applied to tedvriqiMW which Involve encoding funcUon, active pwfw^rwnce by all 
parts of the sys^Mn, and in^dt riKEOgnition of error." (ttricQ 

WHle Dur approach is fundmMmtidty a re^r afH>roach, it Is compatible 
with realization of fauit-toterant machines. Cells can act either as active 
compcNients or as 8t«ictt>y parts in a stan(ft>y redundancy system. The mechanism 
for switching cellular standby parts into active status is built into the cells of an 



PAGE 80 

array. Machines using masking redundancy can be embedded in an array. Chapter 
4 presents a cellular array consistent with such a hybrid, fault-tolerant system. 
Two machines in such an array can monitor each other. If one machine fails, the 
other can test the failed machine's subarray; and embed a new, perfect machine In 
that array. 

<Spandorfer 68> describes discretionary wiring as one repair technique 

designed to provide large, high-yield ICs: 

"The number of basic components fabricated on a large semiconductor 
chip is larger than that needed for the desired final circuit. Each 
component has associated bond pads, which are probed during testing. 
The location of good components is used by a computer to generate 
patterns which wire only the good components Into the final circuit . . 
. Arrays make use of two metal and two Insulation masks." 
The fact that the considerable difficulties inherent In this approach were 

even attempted points up the desirability of high-yielding high-Integration ICs. 

<Foss 70 mentions some of discretionary wiring's difficulties: 

"The layout of a discretionary wiring cell array is made very inefficient 
by redundancy and the need to allow probe testing of the individual 
cells. Although the 'yield problem' of the logic cells is eased, the 
approach still requires '100% yield' on the subsequent dielectric 
deposition and metallization processes. (These are normally the lowest- 
yield steps in wafer fabrication using two metallization steps. - FM) As 



thestt Tmjst be fauitlese over a tmjch greatw wea ttwi needed for non- 

drscrciflonary cetl interconnection, it may well be found that the yield 

problems of the two approaches (drscrefUenary ^ririhg and conventional) 

aretMt cRv^fiMar." 

<Spandorfer 6^ remarks that "the key problem is the criticid di^p>endence on data 

processing for wmk layout, albeit off-tine, and a host of othM* production control 

routines per copy. Further, use of a unique mask implies « relatively slow 

production procttss for a given capital investment." In edition, each prochjct has 

unique inteixi»«i»c^on and dynanwc characteristics." (See <K^vin 67>.) These 

reasons indicate ^^ discretionary wiring is now gwteritfty coiwdered a bed wi^ 

to acNeve hi|^ Megratitm. 

Other owkm wriring teehraques for ui^tq; fai4ty comyMMei^s have been 
proposed. <T«TMDaru 67> describes i»e of customized board wiring to allow use 
of faulty ICs. 

< Sanders 72> describes a method that involved color-code 
categorization ^ prtly-i^od mmory con^onents and thdr ^ Aoaqu wt inttatlation 
in a standard cdcr-keyed circuit board. These boards used logic that transformed 
Incoming addrettM to addresses of good memory words. Texas Instruments 
expects to use a similar method in late 1 975 for realization of perfect bubble 
memories from ttir^erfect components. The major dfficulties with these two 
techniques are the handling of multiple part types and the need for defect 
statistics that are firfrly consistent as time passes. 
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Unlike our approach, those above require perfect Inter-component 
wiring. Furthermore, human and mechanical intervention are necessary to repair a 
system that develops a fault after the system is initially fabricated. 

2.5.B Cellular Arrays 

Cellular arrays have several potential testing and repair advantages. 

<Kautz 67> notes: 

"One would naturally expect that the iterative structure and the short 
internal connections of a cellular logic array would allow it to be tested 
from its edge terminals much more easily than a relatively disorganized 
interconnection of the same number of gates. If test signals are able to 
pierce the first one or two cells at the edge of the array, they can 
. probably be arranged to pierce arbitrarily deeply. Similarly, if an error 
due to a fault can be made to pass through one cell toward the output 
terminals, there is a good chance it can pass the entire route. In 
addition, the regularity and shortness of connections in the array tend to 
support the convenient assumption that the array as a whole is fault- 
free if each individual cell can be shown to be fault-free." 

One might also expect iterative array to be amenable to simple, iteretive test 

procedures (see <Thurber 69>). 

<Kautz 67>, <Menon 71>, <Tammaru 69>, and <Seth 69> study cutpolnt- 

connected combinational arrays in which the output of each cell is a fixed function 
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of only that ceif $ inputs. These papers concentrate on array testability and 
diagnosability: Qw capability, method, and time to test for, and preferebly locate, 
faults of an tmmxmd rattre, via inputs and mi^Mits at the ed|e of an array. A 
common assumi^en is that atl input combinations to a cell m»t be included in 
testing of that cell (Kautz, Seth), or at least that each gate In a ceH mutt be teeted 
(Menon). 

Our i^proach is quite different We treat arrays of more complicated 
cells - checkerbewd «Tays of progratwnaMe i(^ 9nce we dont care ebout a 
cell's response to state-input combinations It woift encounter durbtg Its operation, 
testing and repair are facilitated An edge ceH with floattng inptils or outputs need 
not have these Hnes connoted to a test nMcNra. Instead of mkisng for necessery 
and sufficient eenctfticMis on array testakMiity or cftapiesirt»iHly, we construct 
modules that cmvert a givm wray to one tlwt Is ei^y tested and repwred by 
progranw we cbso'ibe. 

<Yau 70> also treats 2-dimensional cu^nt-conneeled combinetional 
logic arrays. The paper presents effksiant nwthoete for edcft^ k^ md tenraniAs 
to each cell of an array to mai^e the resultant, modified array diegnosable, and for 
deriving test s<^fediies for it. The logic and termlnais added depend on the 
origind ckisigrH they are not strndwd Test signals «m routed to md irom the 
edges of the array. Repair is accomplished by shorting or opening metal 
interconnections. 

<Spandorfer 65> describes repair of two-dimensional celMar arrays by 
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use of computer-determined wiring patterns which preserve the two-dimensional 
topology of the array. Custom metallization, which may pass over flawed cells, is 
used to convert a flawed array into a smaller, perfect array by proper connection 
of perfect cells. This approach assumes the interconnection network of a perfect 
array must be preserved; it does not consider the class of a machine embedded in 
the flawed array. 

<Minnick 66> describes use of custom metallization for repair of various 
arrays. He also discusses the efficiency of associated repair strategies for 
cutpoint arrays. 

The closest precursor of our testing and repair approach seems to be 
<Kukreja 73>. Testing a cell in a particular state involves including each of its 
inputs and outputs in a signal path to an edge input or output. Test signals to and 
from a tested cell are carried by cells in transmission states. Repair comes from 
programming cells in a row or column containing a faulty cell to behave like wires 
linking good cells (see figure 2.2). 

Our test and repair of arrays embedding high-relcon machines similarly 
involves use of cells in transmission states. However, there are key differences 
between our arrays and Kukreja's. Kukreja's 2-dimensional array is a cutpoint- 
connected array of simpler cells, in which each cell receives control variables on 
lines from a third dimension. Cells are not programmable logic cells, so there is no 
loader. Sequential machines are realized by a 3-dimensional stack of 2- 
dimensional arrays. Testing requires direct connection between a test machine and 
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Fig, 2.2 Rpogrzunmed Array- Repair 
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all the edge cells of an array. Kukreja's approach requires far more cells and 
extra-array connections for testing of arrays, and for realization of most machinaa. 
Kukreja's repair approach is what we call "simple repair"} our repair procedure is 
more complicated, but also more efficient for most checkerboard arrays. Kukreja*8 
emphasis is quite different. He does not address many of the design, testing, and 
repair issues we address. 

In sum, our approach is the first one we've seen that details LSI- 
oriented circuit modules and describes associated software for low-cost, 
automated, electrical testing, repair, and customization of cellular arrays. It is a 
systems approach whose advantages will become clearer as its description 
becomes more specific in the following chapters. 
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CHAPTER 3: ARRAY-EMBEODEO AI^S 

Section 3.0: tntraduction 

Thif iilifliDter presents several examples of macMms thet are embedded 
as arms. Since any one of a large set of ioadyng arms m^ be grown and retracted 
by loading sifpi^ input to one side-set of one arm base cell, fietdtde loadng can 
occur in a flowsd array. Processing-layer arm macHnes, wNch we composed of 
balanced cell% ean be graduelty grown and tested, and snaked around flaws in an 
array. We focye on the c^t mechanisms and support programs that provide these 
capabilities. Fori9>ecificity and practicality, we concenbate on the reirfization of 
highly integrated, length-programmable, computer-repairable shift-registers. 
However, our technique* apply to other arm machines. Theee techniques are 
easily generalized to tree and htgh-relcon macNnes. 

We present a mono-active, balanced loading medwnism for growth of 
loading arms. The loading inputs of any side-set S of a cell we sufficient to load 
that cell's loadlr md function-specification state bits. After the ceil is loaded, it* 
loader state bits may specify which of the celPs neighbors, if any, receives loading 
information funneled through the celt from side-set S. The loader's balance allows 
an arm to funnel the same command to a cell independent of the path the arm 
takes in reaching the cell. Optional cell modules extend the loader's capability. 
The loader state may specify that an arm's tip be re-loaded, or than an arm 
incrementally retr«:t A brief signal to the base of an arm can cause the erm to 
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totally retract. We use this same loading mechanism throughout our work. A 
mechanism with its capabilities is easily incorporated in arrays of two or more 
dimensions. 

The most interesting machines that can be embedded In the processing 
layer of this chapter's arrays are arm machines. Embedding is particularly easy for 
these machines. Typically, the loading and processing arms grow together through 
the same cells. The Array Programmer adds one cell at a time to these arms; and 
tests the new, extended arms after each extension. The Array Programmer only 
communicates directly with the processing inputs and outputs of one side-set of an 
arm's base cell. When the arms encounter a flawed cell, they may be partially or 
completely retracted, and grown through different cells In the array. An arm may 
be grown in any direction, avoiding flawed cells, because of the loading arm's 
flexibility and the fact that an arm of balanced cells is grown in the processing 
layer. Our description of the testing and repair processes depends on a model of 
flawed cells' behavior, which we state and analyze. We study repair efficiency 
through a program that simulates repair, and suggest techniques to improve 
efficiency. We consider other issues relevant to the practical implementation of 
our arrays. 

Repair through the interwoven processes of arm growth and testing 
contrasts to repair of high-relcon arrays. Since the requirements on the 
communication paths between essential cells of a high-relcon embedded machine 
are more stringent, repair efficiency is enhanced by the location of all the flaws In 
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an array before a global determination of a good way to repair the array. Repair 
efficiency thervfore dictates that a test procedure Is limited In its ability to 
predict the ro1« a given cell will play in an en^Mdded machtne. No matter how 
large an arm flMCiilne grows, its \nf»A» and outputs are riways tA one side-set of 
its base ceil. Mn^oven processes of mad^ne growth Mid testing are hanqsered 
when a machlrie*s growth implies a sometimes-growing number of inputs and 
outputs: the nartoer of connectlwts between a test macNne and a partlirily grown 
embedded madvne is variidaie, and may be iarga These conwderations encourege 
us to test «M ^ cdts in a hi|^r^«i array iwfore repairir^ that amy. Howbvw, 
the test and ri^sair processes for high-relcon machines use the same loader 
described in ti^ chapter, and fataimced procesting transnisslon states similar to 
the balanced states in this chapter. Many practical implementation issues are 
similar. Thus this chapter is useful in itself, and as a bridge to tl» next chi^ter. 

Since a tree madiine may be endraddc^ as m arm, ifs not surprising 
that the approadi described for arm macNnes is reacfily adapted to tree roadilnes. 
Because a tree macNrw may be realized by an en^aedded madine ^th any tree- 
like reicon network, including an arm network, an array en^edding a tree macNne 
is particularly easy to test and repair. Embedding a tree or arm machine ie 
considerably easier than embedding a Ngh-relcon macNne. 
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Section 3.1: The Loader 

In programmable logic, a loading mechanism is used to load the function- 
specification state bits in a cell. Others have proposed loading mechanisms 
incorporating long, fixed, irredundant signal paths routing loading information to a 
given ceil. These loading mechanism have major limitations, including susceptibility 
to catastrophic failure due to destruction of a long, critical loading line. We 
propose a method which incorporates extra logic elements in each cell to allow the 
flexible growth of a loading arm in an array. A loading arm is composed of cells in 
proper loading states, and not long lines. Since an arm may be grown from any 
cell, an entire array can be loaded via inputs to one cell in the array. 

Figure 3.1 illustrates two common methods for loading the function- 
specification state bits contained in a shift-register in each cell. Figure 3.6 
describes a shift-register's operation. Snake and Crisscross use parallel-out shift- 
registers, with each output acting as a function-specification state bit connected to 
the processing layer. The shift-registers in Snake and Crisscross are parallel-out 
shift-registers, with each output acting as a function-specification state bit 
connected to the processing layer. These connections are not shown in figure 3.1. 
In Snake, each shift-register is part of a long shift-register that snakes through all 
the cells of the array (see <Spandorfer 65>). (If each function register is 
associated with one cell, cells in different rows have different loading inputs and 
outputs; the array is strictly cellular only if we conceptually group mini "cells" into 
a macro cell.). In Crisscross, one of Wahlstrom's methods, each cell is associated 
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Fig. 3.1 Two Common Progranmable Logic Loading Mechanisms 
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with a unique ciocl^line, dateline pair (see <Shoup 70> and <Wahl8trom 69>). Each 
clockline extends through a column of cells, and each dateline extends through a 
row of cells. Co-column cells must therefore be loaded simultaneously. 

Note that these methods could operate on a more general type of array. 
For instance, function shift-registers in different cells could have different lengths, 
drive different circuitry, etc. That is, the key idea is that a loading mechanism is 
iterated through the array. The loading techniques we describe are also useful In 
this type of array, if it has two or more dimensions. 

Like the loading methods of figure 3.1, our loading method loads 
function-specification state bits into a shift-register. However, our loading method 
uses logic elements in each cell to allow loading information input to any cell to be 
routed to any other array cell that is not walled off by faulty cells. Loading inputs 
set a cell's function-specification state bits and loader bits. The loader bits 
specify how subsequent loading information input to the cell is to be handled. 
They may, for instance, specify that it is to be routed to some neighbor-cell. 
Consequently, loading information to a cell may be routed to any cell In the array 
by a loading path, or arm, of cells in appropriate loader states (see figure 3.2). 
Loading signals input to the base of the arm can load the tip of the arm, extend 
the arm from its tip, or retract the arm. Proper use of a perfect array's loading 
mechanism only allows the embedding of arms in the loader layer of the array. 

Our flexible loading arm has several adventeges compared to the 
loaders of Figure 3.1: 
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Fig. 3.2 A Loading Arm Grown By Array Progranimer Signals 

arm's base 



Array 
Progrsuramer 



^ 



arm's tip 



PAGE 94 

1) The methods of Figure 3.1 depend on long, inflexible signal paths. 
Each cell can be loaded in only one way, so the cell is useless if that 
way doesn't work. A signal path connecting many cells is a weak link In 
terms of repairabilityj its destruction severely limits the usefulness of 
the array. Furthermore load impedance, noise, and delay considerations 
make long lines undesirable. Our loading method does not require any 
long signal lines. In some technologies, such as magnetic bubbles, all 
long lines, including power supply lines, can be eliminated from the 
array. 

2) The other methods require that many cells be loaded 
simultaneously, even if one only wants to load one. For Snake, this 
requires the reloading of an entire array even when one wants to 
change the state of just one shift-register in the array. For Crisscross, 
this requires the reloading of an entire columa In our approach, loading 
cell A from cell B requires a loading arm from A to B. If no arm already 
exists, only the cells on some path between A and B need reloading. 
Once a loading arm is formed, its tip can be easily moved around. This 
is particularly attractive because two successively loaded cells are 
usually close to each other. 

3) Crisscross requires a loader input to the array for each row and 
each column of the array. Large arrays consequently require a large 
number of input pins and associated connections. Recognizing this 
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deficiency, <Wahistrom 6^> describes an extension of Crisscross. In 
this extension, a ceil can enter a state in whicli a processing input Is 
transmitted to a dataline above it or a clockline to Its right This allows 
loading of an arbitrary cell In an array via processing and loader inputs 
to a cell in the lower-left corner of the array. Wahlstrom admits that 
such loading is indirect and slow. Its utility is severely restricted if the 
lower-left corner of the array is faulty. Our method allows speedy 
loading of an arbitrary cell with the three loader inputs of a side-set of 
any one cell in the array. This implies that connecting the loading 
outputs of some cell in one array to the loading inputs of some cell in 
another array allows a loading arm to be extended from the first array 
into the second array. This minimizes the number of pins required for 
testing, loading, and repair in systems composed of several ICs. 

4) For all the loading methods, one can envision a function state in 
which a cell's processing inputs control loading lines near the cell. (A 
macMne in Chapter 4 uses such a state to allow a machine embedded in 
an array to test its environment, and to construct and repair machines in 
that environment.) For the other methods, there are harsh limits on the 
position and number of cells that can be loaded from such a cell, even in 
a perfect array. With our method, any cell can be loaded from any 
other cell that is not walled off by flawed cells. 

5) A loading arm's flexibility allows it to avoid flaws in a faulty array. 
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6) Our method allows use of several loading arms simultaneously 
loading unrelated cells. Because the other methods involve loading lines 
extending through many cells, they do not allow this. 

Our loader demands a small number of additional logic elements In each 
cell to achieve its advantages, but the cost of logic elements is declining rapidly 
compared to other system costs. 

A cell's loading mechanism allows the loading of function-specification 
state bits in the cell. This mechanism consists of a Basic Loader, which is usually 
combined with one or more loader options. The site of loading activity in an array 
is the tip of a loading arm. The Basic Loader allows the extension of an arm to 
include any of the tip's neighbors which aren't already in a loading arm. A Total 
Retractor option allows the rapid destruction of an arm by a single signal to the 
base of the arm. An Incremental Retractor option causes a tip's relcon neighbor in 
a loading arm to be the new tip; the loading arm is then incrementally retracted 
The Tip Changer option allows a tip cell to be repetitively loaded. The loader 
options demand extra logic elements, but extend the power of the loading arra 

Each cell in a checkerboard array has Select, Clock, and Data loader 
inputs and outputs at each of the cell's four side-sets. When an array's power Is 
turned on, its working cells are inititalized so that all their Select output lines are 
low. Raising a side-set SS,'s Select input activates SSt: Data and Clock inputs at 
SS, may load the cell's register containing its function-specification and loader 
state bits. The newly Selected cell is the tip of some loading arm. A counter in 
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this tip cell counts the number of bits shifted into the cell's register after 
activation; so the cell knows when its register has been loaded. The loader state 
then specifies the new arm tip. With the Basic Loader, one of the loaded ceirs 
interconnection neighbors that isn't in an arm may have its Select input at side-set 
SS2 raised. Then Clock and Data information from the base of the loading arm flow 
through the arm, through the former tip, to the new tip. This process may Iterate. 
Loader optiorts extend an Array Programmer's ability to control a loading arm. 

Figures 3.3 through 3.5 give an embodiment of our loading mechanism In 
a checkerboard cellular array. Most discussion of the loading mechanism is on a 
functional level. The loader can therefore be understood without reference to 
these diagrams, but they are included for specificity. Figure 3.3 shows the 
mnemonically-initiallzed names of the loading inputs and outputs of a cell. The 
loader lines are Select, Clock, and Data. Up, Right, Down, and Left refer to the 
cell's four side-sets. Figure 3.4 shows one possible realization for the Pulser In 
the lower-left corner of figure 3.5. After power is supplied to an array, the 
pulser's OUT line in each working cell remains low long enough to assure that all 
appropriate memory elements are simultaneously reset Intel's microprocessors 
have a circuit with this effect. The other elements in figure 3.5 - a complete 
functional diagram of the loader for one cell - are familiar standard logic elements 
like those in a Texas Instruments TTL catalog. The function of each logic element 
is summarized in the symbol table in figure 3.6. These elements could be realized 
in many forms and technologies. 
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Fig. 3.3 Input-output Lines Of A Cell's Loading Mechanism 
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Fig. 3.4 T?he Loading Mechanism's Pulser 
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Fig. 3.5 Loading Mechanism With Options 
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Options are marked by - - - boxes. Not using an option implies 
elimination of associated elements. if the resultant circuit 
has an AND gate with only 1 input, that gate is replaced by a 
wxre connecting that input and the output. 



Fig. 3.6 Symbol Table 
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(1st of 2 pages) 



A) Combinational logic elements 
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B) Pulse-maker 

Although combinational logic elements ideally act instantaneously, actual 
devices involve a slight delay before a change in the inputs Is reflected at the 
outputs. This delay is used in the pulse-mailer. 
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Fig. 3.6 Symbol Table 
C) Memory elements 



(2nd of 2 pages) 



Jilways indicates a DATA input. 

C alwj^s indicates a CUX^K input for a memory element. The memory 

elem^l re^ionds ^ aj»09itiye<«going transition on Cs input 

f^ always indicates a RC^T in|»jti aU outputs of the memory element 

areOifR«l. 

Q always ifldteates an OUTPLTT of a memory-eleroent 




FLIP-FIC^. Q takes the value of D 
when e poeltiyA-^foif^ C tranaitlon 
ocars. 



D 
C 



QO Ql Q(m-l) 

S-R (ra) 



SHIFT-REGISTER <m-bit, serial-in, 
pwaitel-Mt)^. ^bjtputs are QO through 
Q<m-1). 



QJ. QCm- l) 

— ♦- tS rt — 




This is «|uivalent to tte shift-register 
above. 



SHIFT-REGISTER (m-bit, serial-in, serial- 
out). Only Q(ro-1). the test bit of the 
sNft-refister, is output 





1 


1 • • • 1 




QO 


Ql Q(m-l) 


c 




CTR (m) 



COUNTER (m-bit). This counts in binary, 
changing state on positive-going 
transitions of G. If m > 2, the counter 
has the state-transitions for {QO Ql) of: 
(0 0) to (1 0) to (0 1) to (1 1) to (0 0). 
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We first detail the Basic Loader, with none of the loader options. S-R 
FL is the shift-register containing the function-specification and loader state bits. 
LOO and LOl are the loader state bits that specify the loader's output side-set 
LSTA is only included when the Tip Changer is used; this loader state bit specifies 
whether a tip cell is to be re-loaded. S-r FL may have any number of function- 
specification state bits. Here we assume four such bits - INO, INI, OUTO, and 
OUTl. CTR, a counter, counts the number of bits shifted Into FL after a cell Is 
activated. Since CTR must be able to count to P, the number of bits in FL, CTR Is 
logjP bits. The P-detector outputs a 1 when CTR's count is P. For P •= 6, the P- 
detector performs the function OUT = B2 AND Bl AND NOT BO = 6, in binary. 
TCH, the "touch" flip-flop, signals that a cell has been loaded. 

The Basic Loader is used to load a perfect array in the following way. 
When power is supplied to the array, the Pulser resets CTR and TCH to 0. The 
CTR, the P-detector, and the TCH flip-flop are used to determine when a cell's 
shift-register FL has been loaded S-r FL is in an indeterminate state (although 
some processing layers require it to be pulser-resettable; this forces all cells into 
the same function state when power is turned on). All extra-array inputs to the 
array are 0. Assume that S.LIN for some cell A goes from to 1. Cell A has been 
TOUCHED from the left, and now its left loading inputs are ACTIVATED. CLIN and 
D.L.IN may now affect the cell's function-specification and loader state bits and its 
loader outputs. This prepares shift-register FL of cell A to be loaded via D.LIN 
and C.LIN. Since all other S.INs are 0, all other side-set's loader inputs are not 
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activated. D.LHSI is relayed to D.OUT and C.LIN is relayed to C.OUT. Besides being 
the D outputs of the cell, D.OUT Is the D input to shift-register FL Since TCH-0, 
C.OUT is the C input to sNft-register FL and CTR. The first positive transition of 
CLIN causes 

1) D.LIN to be shifted into shift-register Fl^ and 

2) CTR to be incremented to (BO Bl B2)m 0). 

During loading of the cell, CTR functions to count the number of positive C.iN 
transitions since the cell was touched That is, it counts the number of bits shifted 
into shift-register FL Succeeding CLIN positive-transitions will similarly shift 
information into shift-register FL and increment CTR. The "p"th such transition 
(6th in this example) causes 

1) the 6th CLIN bit to be shifted into shift-register FL, so that all the 
information in shift-register FL has been loaded from CLIN since S.LiN 
went high; and 

2) CTR=^(0 1 1); this causes the output of the P-detector to go high. 
Thus shift-register FL has been loaded with function-specification and loader state 
bits; the P-detector signals this fact by sending a high signal to the input to the D 
flip-flop. When CLIN next goes from high to low, TCH goes high. This causes 

1) the C inputs of shift-register FL and CTR to remain low; C.OUT is no 
longer transferred to them; and 

2) one and only one S.OUT to go high, thereby activating Inputs at the 
side-set of some "touched" neighbor cell. The one selected is 
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determined by LOO and LOl. 

The loading arm is a loading-signal path starting with some base ceil 
with a high S.IN, and possibly extending from that cell to other cells, with the arm's 
path marked by high S lines linking neighboring cells. Figure 3.2 showed one such 
loading arm. With the Basic Loader we restrict a cell from touching a cell that is in 
a loading arm. In this case, this means cell A should not touch left, the source of 
loading information. It may touch cells that are up, right, or down neighbors. 
Assume cell A touches cell B above cell A. We then say that the loading arm's TIP 
has been moved up from A to B. This is caused by loading cell A with L00«L01«O, 
when cell A's TCH goes high, its S.U.OUT goes higK That is, cell B is then touched 
by cell A. Because S.L.IN is still cell A's only high S.IN, it's still true for cell A that 
C.OUT=C.LIN, and D.OUT=D.L.IN. Because cell B is the only cell that A is touching, 
cell B is the only neighbor of cell A to accept C and D Information from A. B can 
now be loaded in the same way that A was. B may then touch some new neighbor 
FL, funnel loading information to this new tip, etc. That is, this process of a cell's 
being loaded, touching a neighbor, and funneling loading information to that neighbor 
may iterate. In this way a loading arm may be snaked through an array, with its 
length only limited by the size of the perfect array. This growth of a loading arm 
to any cell from any other is facilitated by the loading mechanism's mono-active, 
balanced nature. 

A brief example will illustrate loading via growth of a flexible loading 
arm. Assume a perfect array was to be loaded with function states (INO INI OUTO 
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OUTl) equal <0 1 0) for cell (0 0), (1110) for cell (0 1), (0 10 1) for cell (1 
0), and (1 10) for cell (1 1). The Array Programmer may connect to cell (0 0) 
in the manner shown In figure 3.8. After the array is turned on, all cells have 
CTR=TCH=0. The Array Programmer raises S.L.IN of cell (0 0), the base of the 
loading arm. The Array Programmer uses the C and D lines to clock out the 
sequence {0,Q,0,1|0,0} in the manner indicated in figure 3.7. This loads shift- 
register FL with (INO INI OUTO OUTl LOO L01)=(0 10 0). That Is, the 
function-specification state bits have been properly loaded and the loader bits LOO 
and LOl tell cell (0 0) to touch up. At TO, the first downward transition of CLIN 
after the loading t)f shift-register FL, S.R.0UT of cell (0 0) is raised. Cell (0 1) Is 
now ready to receive C and D information from the Array Programmer, routed 
through cell (0 0). 

The subsequent sequence clocked out of the Array Programmer via the 
C and D lines is {0,1,0,1,1,1}, {1,0,0,1,0,1}, {1,0,1,0}. Thus all the cells of the array 
have been loaded by a loading arm snaking through the array in the manner 
indicated in Figure 3.8. A different-shaped loading arm could have been used to 
accomplish an equivalent loading of function-specification state bits. 

We now consider the various options available to enhance the capability 
of the Basic Loader. The Total Retractor allows a loading arm to be grown and 
later totally retracted by a signal to the base of the loading arm. This allows, for 
Instance, reloading of cells and rerouting of a loading arm to new cells from the 
same arm base. With the Total Retractor, a perfect array is loaded just as 
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Fig. 3.7 Clocking Out The Loading Sequence 0, 0, 0, 1, 0, 
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described above. Assume the loading arm of figure 3.8 exists. If the Array 
Programmer lowers its S line, no S.IN of cell (0 0) is high; the Total Retractor of 
cell (0 0) causes that cell to be reset to CTR=TCH=0. When TCH goes low, all 
S.OUTs of ceil <0 0) go low. This resets (1 0), which resets (1 1), which resets (0 
1). The function-specification state bits of these cells are unaffected. The Total 
Retractor thus allows the resetting of all the cells in a loading arm by lowering the 
S input to the base of the arm. These reset cells are then ready to be re-loaded 
by some new loader arm. 

The Incremental Retractor allows a loading arm to be shortened cell-by- 
cell, instead of all-at-once as with the Total Retractor. The Incremental Retractor 
shown in figure 3.5 includes the Total Retractor circuit, so this Incremental 
Retractor is always used with the Total Retractor. The Incremental Retractor can 
save time when, for instance, one wants to change the state of a cell that is near 
the tip of a long loading arm. Consider the long loading arm of figure 3.9. If the 
Array Programmer wanted to reload cell <99 0), as it might on the basis of some 
test on cell (100 0), cell (100 0) could be loaded with information telling it to 
touch left. When S.LIN of cell (99 0) went high, the incremental retractor of cell 
(99 0) would create a reset pulse. This would reset cell (99 0) for subsequent 
loading from (98 0). Resetting of (99 0) would lower S.R.In of (100 0), thereby 
causing (100 0)'s Total Retractor to remove (100 0) from the loading arm while 
leaving (100 0)'s function state the same. This incremental retraction is much 
faster than the equivalent action of total retraction and subsequent growth of the 
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loader arm from (0 0) to (99 0). 

In loader realizations in which the Incremental Retractor does not Include 
Total Retractor circuitry, the Incremental Retractor may be used for a type of total 
retraction. If all the other S inputs of a cell are low, lowering its S line and then 
raising it prepares the cell to be loaded from that S line's side-set. Assume an 
Array Programmer wanted to grow a new loading arm from the base of an existing, 
unnecessary arm. By lowering the cell's high S line, then raising It, the Array 
Programmer would prepare the cell to be loaded, forcing all the cell's S.Out lines 
low. The cell could re-touch the cell It last touched, or touch some other neighbor, 
and a new arm could be grown from the old base. Of course, part of the old arm 
might remain in the array. Under certain conditions this is Intolerable; loading a 
cell in an old arm from some new side-set Involves special considerations, as we'll 
see. Nevertheless, many applications make fast retraction feasible through 
exclusive use of the Incremental Retractor. When fast retraction Is unfeasible, an 
arm may be totally retracted cell-by-cell via the Incremental Retractor, as we've 
discussed. If even this is impossible, due to a growth failure at the loading arm's 
tip, the Incremental Retractor allows a new arm to grow through the working cells 
of an old arm; subsequent incremental retraction frees these working cells from 
the loading arm. 

The Tip Changer allows a tip cell to be repetitively loaded by the same 
loading arm. This is another time-saving device, particularly helpful when one 
wants to test the same cell in various states. It involves adding an extra bit to 
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shift-register ft, and consequently requires one more clock pdse for the loacRng 
of a cell. If a ^ cell Is loaded with L$TA«1, the downward CIN transition at TO 
(directly after the P-detector goes high) causes resetting of CTR and TCH to 
CTR-TCH-a T*ie fact that LSTA is high atso prevents the ceN from toucNng ariy 
other cell. Thus the cell is loaded with a function state, remains the loading tip, 
and is therefore ready to be immediately reloaded, if the oetl is loaded with 
LSTA-0, it me^ touch another cell as if no Tip Changer existed. 

Thus the Basic Loader can combine with a con^ination of Total 
Retractor, Incrementai Retractor, and Tip Changer. The particular combination used 
In an array depends on the specific objectives for that array, to arrays designed 
for infrequent ioa^ng^ minimization of circuitry by exclusive use of one Retractor 
option might b« appropriate. The rest of this chapter details growth of shift- 
register arms, ff program-variable shift-register length was important to an array, 
variation-speed coi^dtt'ations might encoiwage use of all loader options. 

In summary, the fundamental loading mechanism allows loading inputs 
from one of several sides to control loading of a cell. The cell may learn that, and 
how, subsequent information input from its active loading side-set should be 
passed to loading outputs of some other side-set If the set of loading mechanism 
neighbors is property chosen, inputs to any cell may cause the loading of a cell 
anywhere in a perfect array, and loading of most cells in a flawed array. The 
loading miaehanism may be incorporated into arrays with diverse processing layers. 
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Section 3.2: A Perfect Array Of Shift-register Cells 

We now examine one realization of a complete shift-register cell, shown 
in Figure 3.10. We'll eventually show how an array of such cells can provide large, 
highly integrated, variable-length, automatically testable and repairable memories. 
For clarity, we begin by considering an array of such cells which contains no 
flawed cells. The shift-register cell's loading mechanism is almost identical to the 
loader of Figure 3.5. For simplicity, we assume that all the loader's options are 
included in the shift-register cell. In fact, the approach we describe can be 
adjusted to work with just a retractor optica 

Each side-set has Select, Clock, and Data loader inputs and ouptuts, 
whose function has been described. In addition, each side-set has distinct Klock, 
iNput, and Return processing input-output lines; there is one set of K.IN, K.OUT, 
N.IN, N.OUT, R.IN, and ROUT lines in each side-set. The shift-register cell could 
have been realized by disjoint loading and processing mechanisms. However, the 
cell shown in Figure 3.10 reduces circuitry and loading time by using bits in shifts- 
register FL in a dual role as function-specification and loader bits. The loader of 
Figure 3.10 corresponds to that of figure 3.5 with the following mapping: 

Figure 3.5: INO INI OUTO OUTl LOO LOl LSTA 

Figure 3.10: INO INI OUTO OUTl OUTO OUTl STA 
S-R FL is reset when power is turned on to limit processing layer complications of 
certain faulty cells. 

Figures 3.11 and 3.12 give alternate functional descriptions for the 
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Pig. 3.10 A Complete Shift-register Cell 
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Fig. 3.11 Abbreviating A Shift-register Cell's Function State 
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The FUNCTION STATE diagram indicates the important 
processing inputs and outputs for a particular function 
state. 



The short arrow in the ABBREVIATION diagram indicates 
the active Klock input. Its side-set is nearest the base 
of a shift-register arm. 
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Fig. 3.12 Shift-register Cell's Function States 
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processing outputs in various function states. It's apparent that the processing 
outputs depend only on the function-specification state bits, the processing state 
(shift-register A and shift-register B), and the processing inputs. Shift-registers A 
and B are of arbitrary length, with the particular practical length chosen by 
integration-level considerations discussed later. A cell has one relcon neighbor 
when STA=1; a Klock input from a side-set E clocks N.E.IN Information through 
shift-register A, then through shift-register B, and finally out N.E.OUT. A cell has 
two relcon neighbors when STA=0; while K.E.IN clocks shift-register A and sWft- 
register B, N.E.IN flows through shift-register A and then out some side-set F, and 
N.F.IN flows through S.R B and then out N.E.OUT. 

Cells are used to form shift-register arms. Information In an arm flows 
from the base of the arm via K and N lines to the arm's tip, turns, and flows back 
to the base via the R lines. The cell at the tip of the arm has STA=1. This ceil 
acts as a loop; it forms shift-register A and shift-register B Into one shift- 
register, with the same relcon neighbor providing K and N inputs to this shift- 
register, and receiving the Return output of this shift-register. All non-tip cells in 
the arm have STA=0. Each of these ceils receives K.IN and N.IN Information from a 
relcon neighbor nearer the arm base, and transmits R.OUT information to that cell. 
Each of these cells also outputs N.OUT and K.OUT to a relcon neighbor nearer the 
arm tip, and receives R.IN information from that cell. 

A simple example illustrates how the Basic Loader and Total Retractor 
allow the loading of more than one shift-register into a perfect array by use of the 
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Pig. 3.13 Iioading Two Shift-registers Into Perfect Array 
(Sxtra-arxay processing lines are not shown.) 
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loader inputs to one cell. Consider the perfect array of Figure 3.13.A, with all 
cells reset because power has just been turned oa The array's only connections 
with the outside world, other than power lines, are 

1) (0 0)'8 loader inputs, which connect to the Array Programmer; and 

2) (0 0>'s and (0 l)'s N.LIN, K.LIN, and RLOUT lines (not shown in the 
figure), which will provide the inputs and outputs of two shift-registers 
embeckjed in the array. 

After the Array Programmer raises S, the sequence {0,0,lil}f {0,1,1,1}, 
{0,1,1,1,}, {1,1,1,1} is clocked via C and inputs into cell (0 0). When S Is 
lowered, the loading arm totally retracts. This leaves the array in the processing 
state shown in Figure 3.13.B. S is again raised, and the sequence {0,1,1,1}, 
{0,1,1,1}, {1,1,1,1} is clocked out. S Is lowered, and the array is left in the 
processing state of Figure 3.13.C. The processing lines shown are made available 
through some type of wire. Loader lines may also be made available; so that the 
array may be repaired if it develops a flaw, or an embedded sWft-register's length 
may be varied. 

In estimating the time to load a cell's p-bit shift-register FL, we 
consider two extremes: 

1) If the cell being loaded is the base of the arm, the minimum delay 
between C.IN transitions is tmin = 1/fmax, where fmax is the maximum 
clock-frequency of a shift-register. 
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2) If the cetl being loaded i$ many cetis away from the arm*a bate, 
tmin }« deterrotned by 2 factors: 

a) After a new aiN bit has been sent to the loaded cell, 
CM cannot go high until weVe sure the D.iN bit will arrive at the 
loaded cdl before C^s new tranrition. 

b) After tNs CiN transition, O.IN cannot be changed until 
we're sure, the CIN transition witi definttely arrive at the loaded 
cell bef«'e the new D.IN. 

Thus the time to load a cdl n ceKs from the base of its loadNtg arm is the rMter 
of 2 numbers. 

tload * p X max (1/fmax, n x <dmax ••■ cmax - dmin - cmin)) 
Here dmax is the maximum delay of a signal through a cell, and the other d and c 
symbols above are similarly defined. Recall that a logic gate can have a very smell 
delay if it's known that only one of its inputs changes frequency. Noting that the 
C and D deters come solely from an AND-OR function, where ^ ANOs have only 
one input that changes fast, we observe that dmax for a cell is epproximetely 
equal to dmax for a lope gate with a load of four input-loads. 

In estimating the maximum frequency at which an embedded shift- 
register may be clocked, we make two assun^tions: 

1) All bits of shift-registers A and B of a particular cetl are clocked 

simultaneously. 
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2) A clockpulse remains a pulse as it travels down an arm. 

The rate-limiting delay then comes from the delay path schematized in Figure 3.14. 

tmin = andmax + 2 x andormax + s-rmax + s-rsetup 
Andmax is the maximum delay through an AND gate, where only one Input to the 
gate changes often. Andormax is the maximum delay through an AND-OR gate, 
where only one input to an AND gate changes oftea S-rmax is the maximum time 
between a clock transition to a shift-register and the subsequent stabilization of 
its output at its proper value. S-rsetup is the time the D input to a shift-register 
must be stable before a clock transitioa The Klock input to a shift-register arm 
must have a low enough frequency that, for the longest possible arm, a pulse 
remains a pulse as it travels down the arm; and two pulses are never less than 
tmin apart. 

The method we've shown for relaying clock signals down loader and 
shift-register arms has two major disadvantages: 

1) A clockpulse may expand or contract indefinitely If it's passed 
down a long enough arm. This limits the clock's frequency. 

2) The frequency at which Data can be sent down the loading arm is 
limited by the uncertain delay involved in sending a Clock or Data signal 
down a long arm. Ideally a Clockpulse and its associated Data bit flow 
through an arm at the same speed. 

Figure 3.15 shows a simple circuit which eliminates these difficulties. The circuit 
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Fig. 3.14 Shift-register's Rate-limiting Delay 
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Fig. 3.15 Pulsewidth Regulator With Data Transmitter Option 
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Tiie combination of the n-element delay with gate 2 constitutes a pulser 
responsible for outputting a pulse long enough to trigger a neighboring cell's flip- 
flops. The m-element tapped delay in combination with gate 1 lengthens a clock 
input pulse enough to assure that the pulser acts properly. 

Assume that the delay of a signal-transition through any gate is D plus or 
minus t. Assume that the clock input to the Pulsewidth Regulator has been long 
enough so that the outputs of all gates are 0. First consider the Pulsewidth 
Regulator with no option. Clock input receives a positive pulse of minimum length 
P sufficient to trigger any of a cell's attached memory elements. The m-element 
tapped delay is tapped at enough places that a clockpulse P long causes one 
longer pulse out gate 1. This longer pulse has a minimum width W such that 

W > [P + (m + 1) (D - t) - (D + t)] = [P + niD - (m + 2) tl . 
If U > n (D - t) , the clock output is a pulse X with 

K > nD - (n + 2)t. 
Assume that the pulse at the clock output is reduced by at most R as it travels 
through gates to the clock input of a neighboring cell. Then the following 
conditions assure that the neighboring cell receives a pulse of minimum width P. 

nD - (n + 2)t > P AND P + niD - (ni + 2) t > n(D - t) . 
Longer clock input pulses obviously work fine. 

A similar analysis calculates the maximum clock frequency. 

If the Data Transmitter option is used, the clock output pulse must be 
delayed the right amount to assure that a Data bit and its associated clockpulse 
flow together from one cell to the next. 
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assures that a clockpulse of minimum widtli P which is input to a loading arm will 
travel down the arm with a tightly-bounded width. The Data Transmitter option 
assures that a tSata bit and its associated clockpulse flow together at the same 
speed down a loading arm. Placement of the Pulsewidth Regulator before the 
broadcast Klock output of a shift-register cell would increase the maximum 
clocking frequency of a long shift-register arm. The loader's use of the Pulsewidth 
Regulator and (teta Transmitter option would speed loading for long loading arms at 
the cost of slower loading for short arms and increased cell overhead. 

Richard Shoup's method for forming an embedded shift-register is quite 
different from ours. In Shoup's method, a cell contains only 1 processing layer 
shift-register^ function state bits control which INput goes to the shift-register, 
and the output of the shift-register is broadcast to all its neighbors. Clocks to the 
shift-register cells come down the Data control lines, the same lines used for 
loading function-specification state bits. This of course means that all co-column 
cells must be clocked simultaneously; they can't, for Instance, be used for 
different regi«ters with different clock frequencies. Shoup's arrays are relatively 
hard to test, especially for large arrays. Testing requires "building up shift- 
register paths of increasing length between opposite edges of the array." (See 
<Shoup 73>.) Every cell is tested in all 4 directions; we'll see that this Is an 
unnecessarily large amount of testing. The tester accesses the processing inputs 
and outputs of all edge cells; this requires excessive use of probes and bonding 
pads. Our loading method gives our shift-register arrays many advantages. We'll 
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see that the fact that all communication with an embedded shift-register arm is 
through its base also facilitates testing. The major drawback of this bi-directional 
capability of our shift-register cell is that it slightly reduces a shift-register's 
maximum clock frequency. 
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Section 3.3: Tatl^B And Rtpatr 

In tfM« section we consider tlie concurrent processes of testing end 
repair involv«l in embedcRng a sHft-r^ster arm nwcMne in a ft«wed wtsf. Th» 
sNft-register ceH^ is the one we* ve been eonsicterini^ ttuA of fl|yr« 310. We focue 
on growth of one arm from a ban ceii with loading and processing connections to 
an Array Propwnmer, Ixit the technityjes cfisciosed are easily generalized The 
Array Progranmer uses a loading arm to grow longer and Icmf^ shift-register 
arms, lilte the hio-in Figure 3.13 The growing shift-register arm extends through 
the same ceils as the loading wm The arm is tested as it grows. Fwlure to peso 
a test indicates ^t tlw arm nHJst twist throui^ the arri^ in e slli^tly (iffweni 
way, so that it includes ^y good cells. If an atteR4>t is made to prockice an arm 
of a certain le^ii^ in a given flawed array wth ir^M^ md outputs at a pven side- 
set, several things may hiq^pea Such an arm may be realized, the nray mtif be 
f<Hind incapable of frmiucing such an wm, or testing nuiy take too Icmg. 

Embeckting an arm of balanced cells is particularly easy. The arm is 
extended ceit-lpji^-eell into an array. When an arm is in a given pMttioi\ the arm ie 
tested under the temporary assun^^tion that its non-tip cells will rerodn in their 
current function states. The relcon neighbors of each body celt are therefore 
known, and information flowing in the arm to and from its base tests the cell's 
communication with its relcon neighbors. As long as a cdl's intercmmection, mm- 
relcon neighbors aren't loaded, it's assumed that their ir^s to an arm cell don't 
change. ConsecH^ntly, it's sufficient to test an endsedded arm via the inputs end 
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outputs at the base of the arm. The cells* balance allows an arm to move in any 
direction as it snakes through good cells in an array. Sometimes extension of an 
arm in an intended direction is prevented by a flawed cell. Then the arm is 
retracted, and the arm's growth proceeds in some new direction from some stump 
of the unsuccessfully extended arm. Since the cells in the stump of the arm stay 
in the same function state, they need not be re-tested. A cell is only tested in its 
role in an embedded shift-register arm. Thus growth of an arm through balanced 
cells facilitates the Interwoven processes of testing and repair. 

Description of the testing process is much clearer if we use an example 
simplified by some assumptions: 

1) Good cells are only loaded under the Array Programmer's control, and 
not by signals caused by faulty cells. This assumption is satisfied If no 
faulty cell outputs a high S at the same side-set where it outputs an 
alternating C, since this is the only way a faulty cell can load a good cell. 
This assumption is also satisfied if any cell that is not loaded under the 
Array Programmer's control is defined as a bad cell, even if it is not the 
cell's fault that it is improperly loaded. Since we'd like properly formed 
cells to be used as good cells, we specify cell mechanisms that help 
guarantee that a good cell is not falsely loaded This involves making the 
set of valid loading commands smaller than the set of possible loading 
commands, so that fault-generated commands are likely to be disobeyed. 

2) A cell's performance depends only on that cell's mechanism, state, and 
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input stgfMftfi. It does not dtrectly depend, for Instance, on the ttate of «om« 
other cdl In the array. Like the fourth assumption, tNs saves testing tinw; 
it's used In most IC testing programs. The assumption is reasonable because 
the only lines connecting different ceits are the side-set Unes and the power 
lines. TMs assumption accounts for side-set lines, in eome technotogl«8, 
such as inagnetic bubbles, coupling could not occur through cell-connecting 
power %mrs because there are no such lirws. With other technologies, such 
as converftional semiconductor technology, it*s true that such coupling could 
occur, ttowever, the regularity of an array is useful in minimizing this 
possibility. Each ceil could contain a simple regulator circuit optimized for 
the highly predictable characteristics of a worldng cell. 

3) Cdis ^t are faulty during array testing mu^ be somewhat consistent 
in their fai^ty be^^vior; that is, 

A) a successfutiy tested ceil doesnn develop new faults during array 
testing; and 

B) ist processing input that doesn't alternate during the testing of a cell 
may not alternate during other array testing, unless the Array 
ProgrMttmer corranands it to altemata 

Assumption 3 makes it easier to localize the cause of a test failure. 
Assumption 3A is reasonable bec%ise the time to test a redizidale array is 
very short compared to its mean-time-between-failure. Assumption SB 
allows a cell in an embedded arm to be tested solely for proper 
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communication with its reicon neighbors; it allows the Array Programmer to 
assume that a cell passing its tests won't misbehave during further array 
testing due to a previously unencountered input signal. (Most ceils wouldn't 
misbehave anyway, since they're programmed to ignore irrelevant inputs.) A 
ceil may have side-sets which are inaccessible to an Array Programmer, due 
to the cell's position near a flawed cell or at an array's edge. All the cells 
of figure 3.16 have at least one inaccessible side-set Assumption 3 allows 
such a cell to be embedded in an arm in spite of the inaccessibility of its 
irrelevant side-sets. Like assumption 1, assumption 3 is valid if all the 
inputs and outputs of faulty cells are assumped to be stuck at some value. 
If assumption 3 is invalid for a particular array, the Array Programmer may 
become confused during testing. In this case the Array Programmer can 
start testing the array agaia Repeated confusion indicates that faults are 
forming at a pathologically high rate; then the Array Programmer signals that 
the array should be rejected. 

4) The behavior of certain mechanisms in a cell is independent of the 
state of other mechanisms. We assume that a shift-register bit works if it 
successfully accepts new information when the bit and its 2 neighboring bits 
are in any of their 2^ - B states. This assumption implies that a shift- 
register bit's function is independent of the state of non-adjacent bits in an 
array. This allows the testing of a length-n shift-register by testing its 
ability to shift a (10 + n)-bit sequence (000101 1 100 — ), in which 
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the n bit* are used to push the first ten bits through the sMft-reglster. 
This cofimron, reMonable assumption it necessary to save testing time; 
testing a 40-bit shift-register in every state would take a sequence of 
approxknateiy 1,000,000,030,000 bits, and we expect a cetl*t sNft-regiater 
to be considerabty longer than 40 bits. The unspecified bits In the 
sequence above could be selected to dm^n maximum current from the 
power si^ly. Similarly, we assume that the processing mechaniam*a 
behavior Js indepem^t of the loadar*» stile. TMa asttimi^ori tavea teat 
time. 

The validity of these assumptions, which are like those made in testing 
conventional cKgital systems, can be made wery likely by proper array design and 
layout. The ultimate test of the validity of these assumptions for a particular 
array is experimentation with that array. 

We now consider a testing process operatir^ under these assumptions. 
Consider the «mf of Figure 3.16, shown in successive stages of testing. The oriy 
extra-array connections are the Array Programmer's processing and loader 
connections to ceU (0 1), which aren't shown in the figtre. Here we assume the 
shift-register arm Is to be 5 cells long; m » 5. When a new cell B is to be tested 
for possible addition to a shift-register arm currently extending to its tip at cell A, 
several things happen. Cell A is put in a state so that shift-register erm 
information is routed to and from B. B Is put into a loop state - (0 0), (0 1 
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Fig. 3.16 Growth Of Perfect Shift-register Into Flawed Array 
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1), (1 1 0), or (1 1 1 1) -with the loop starting and ending at A's side-set 
shared with B. Assume N is the number of bits shifted completely through the 
processing shift-registers of cell B and passed down the arm for monitoring. Also 
assume the Array Programmer knows the contents of all the A shift-registers In 
the arm up through cell A. (The Array Programmer should know this; it's loaded 
these registers.) Testing cell B in an arm of length m then requires (N + alength + 
m X blength) Klock inputs to the arm, where alength and blength are the lengths of 
shift-register A and shift-register B. Passing the test means that the shift-register 
arm works properly; a new tip has been properly added. If cell 8 Is the last, 
''m"th cell of the arm, the Array Programmer is then satisfied that a shift-register 
arm has been properly formed in the array. (See stages 8 and 9 in the figure.) 
The Array Programmer doesn't care whether the cells of the arm could have been 
loaded from other directions, or would have worked in other function states. It 
doesn't care if some cells of the array haven't been tested at all. (See cell (0 2) 
in the figure.) The Array Programmer simply cares that its objective has been 
realized. This pragmatic approach allows substantial reduction of testing time. 

If cell B is meant to be part of a longer arm, it must be connected to an 
interconnection neighbor cell, other than A, just as A was connected to B. The 
testing of this new, longer arm then proceeds as above. Growth is a recursive 
procedure. 

Failure of the arm after its extension from cell A to cell B indicates that 
growth from cell A to cell B is impossibla A may be loading B incorrectly, B may 
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be flawed, A or B may be outputting Klock information to a neighbor elsewhere In 
the arm, etc. The Array Programmer doesn't worry about the specific nature of 
the problem. It simply uses one of two reasonable flaw-models. Ceil B may be 
judged as a flawed cell never to be tested again, as In the example. This 
simplification might be appropriate if the area of shift-registers A and B dominated 
the area of a cell; failure was probably due to a failure in this area A second 
alternative is to just consider the boundary between cells A and B impassable In 
the attempted directioa Cell B might be approached later from one of Its other 
neighbors. 

If cell B can't be approached from cell A, some new arm-path Is tried if 
there is still one to be tried. Cell A considers touching its neighbor cells in some 
established order. When a neighbor is considered for touching, the touch Is 
attempted if the cell exists (isn't out-of bounds), isn't known to be unloadable from 
A, and isn't already part of an arm. Furthermore, extension of the arm through 
that cell must, at least potentially, eventually yield an arm of the desired lengtK 
This last provision explains why no attempt is made to include cell (2 0) In the arm 
in the example; at best a length-4 arm would result. 

If all A's neighbors have been rejected, the arm is forced to try some 
new path that includes all arm cells up to A. In the example, (0 0) of stage 3 Is 
cell A. Since there's no cell compatible with the existing arm that can be loaded 
from (0 0), the arm is retracted to cell (0 1), where new paths are considered 

A program simulates the method described above for loading a shift- 
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register arm Into a flawed, rectangular irray. The unseat fault model is usec^ a 
cell 1$ either perfect or hopelessly flawed A pr(^am fwttm a flaw pattern of 
specified dimcMwlons with randomly sprinkled flaws. Another program tries to 
realize the lon|^ arm po^ble In the flawed arriy, fftmng from a spedfiec^ good 
base cell. A time limH Is used because the program wouM e^entueUy consider ali 
possible arm paths extencRng from the base cell. 

The repair program is ^tort and simple. Whwi an wrm has grown to a 
certain tip, it fries to extend itself toward the newest array edge. Thus an arm 
spirals toward ¥tm center of an nray in a pwfect wray. If rw improviftiimt in the 
maximLflnn discovered wm is made in one-fmjrth of the time limit, the progrem looks 
at adj«:ent celts that we not included In tNs longest arm wkI are not known to be 
flawed The program tries stinple jogging of the arm to include these celts. Ttte 
program retWYYs with a plctiH'e of the resisting ann In the flawed ernQTi end some 
statistics concerning the arm growth. 

Figtres 3.17 and 3.18 show arms snakir^ through two different 25 x 25 
arrays. Statists for these and other, similar experiments, wpit^tr in taUe 3.1. 
Figure 3.19 shows graphs derived from table 3.1. 

• The experiments suggest several conclusions: 

1) For %flawed under about 25, %of total drops about 2.2% when 
Xflawed increases 1%. This Is nearly indepwwient of the size of the 
array, with larger arrays doing slightly better. Repair efficiency dso 
drops steadily. For instance, for the unstarred 400-c^ amy in table 



Fig. 3.17 Result Of An Arm-growth Experiment 
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The relcon network above shows the path of an arm after an arm-growth 
experiment. One can follow the arm's path from its base, at (2 1), to its tip, at 
(15 18). There are 625 cells, 100 flawed cells, and 463 arm cells. We were able 
to repair the array to embed an arm with 495 cells. This suggests that our 
program's repair efficiency can be improved. 
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Fig. 3.18 Result Of An Arm-growth Experiment 




The base cell is (1 1). There are 625 cells, 225 flawed ceils, and 216 arm cells. 



PAGE 134 



TabliB 3.1 Results Of Arm-growth Experiaents 

(Ist of 2 pages) 

Key: 

%flaued - flawed cells as percent of all cells 
Cells - total cells in square array 
#flaw8 - total flawed cells In array 
max-arin - the longest arm our program grew 
%oftotal - cells in longest ara as percent of all cells 
timelim - the tine limit, in seconds 
Time - the time the program ran 
%of timelim - tine as percent of tinelin 
* - For two starred (or unstarred) arrays of the sane size, 
one set of flaw coordinates is a subset of the other. 

Table: 
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Table 3.1 Results Of Arm-growth Experiments 

(2nd of 2 pages) 
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Fig. 3.19 Graphs For Experiments Embedding Balanced Arms 
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3.1, tl»e repair efficiency drops from .98 at Xflawed - 5 to .415 at 
^flawed > 35. 

2) As Xflawed increases, a cutoff point is reached where Xof total 
drops precipitously. In our experiments, tfvs oMiH'ed for Xflawed 
between 25 and 45 This cutoff occurs when an array is so flawed that 
the arm is trapped Very small arrays, such as the 100-cell arrays in 
our «<periments, tend to have tower repair efficiencies and lower cutoff 
points; because a higher percentsfe of cdls are edge cells. An edge ia 
a barrier that restricts the growth of an wm. 

3) The time tal^n to embed an arm varies widely for a fixed Xflawed 
It is roug^y proportional to the nunMser of celts in an wray, wid tends to 
increase as Xflmved increases. When the cutoff point is readied, the 
time to embed an arm ptiHTvnets. TNs is an exan^e where testing and 
rep«r time is far from growing astronomically with an array*8sl2e, even 
though very few input leads connect the Array Programmer and the 
array. 

4^ if the active area of a cell is fixed, statistical considerations state 
that Xflawed varies less as slice size (and mmiber of celts) increases. 
This fact, the near-independence of Xoftotal on the nunid»er of cells in an 
array, the proportionatity of the time to test and repair an array to its 
number of ceils, and the desirability of large m«Tiories in one integrated 
circuit package all argue for fabrication of the largest possible sllcea. 
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5) How large should cells on a large slice be? Assume that the 
dominant consideration is the number of bits in the largest embedded 
shift-register arm The total number of bits in an arm embedded on a 
slice is proportional to the product of two factors: 

a) The fraction Ye of total cells embedded in the arm. Our 
experiments show that for a given cell yield Yc > 3/4, Yg Is 
approximately 1 - 2.2(1 - Yc). Yc is a technology-dependent 
function of defect density and cell area 

b) The fraction of a cell's area containing processing shift- 
registers. If a cell has P area devoted to processing shift- 
registers and V area devoted to other circuitry, this fraction Is 
P/(P + V). 

We can express this product as a function of P and technology-related 
parameters. Finding the maximum of this function via differentiation 
tells us the value of P that yields the highest expected number of bits 
in a shift-register- arm. At one extreme, a large slice has nothing but 
overhead circuitry. At the other extreme, it has one large, flawed cell. 
Note that a minimum condition is that a cell be small enough to make 
•/.flawed below the cutoff point. This condition Is now met In most 
technologies. 
Though the repair simulation program is simple. Its performance is 
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encouraging. Th&re are severd ways It can be improved In a prockiction line 
using large slices, the program would know an expected, minimum size of an 
embedded arm for a slice of a given size. To save coimputer time, it could be 
satisfied when it attained that miiuimm-sized arm, or one slightly icrger. At tMs 
point, use of much more computa^ time to mie<irraze the arm wotid proMtAy not be 
worth the cost Our «roUation ran in con^^ed U^ and no effcM*! was made to 
improve speed. A production-oriented repair program would be carefully written 
in assembly lenguage. More cdinputer time could be used to improve repair 
efficiency. 

We've repaired figure 3.17*8 array to realize an arm 495 cells long. 
Our ability to improve the repair efficiency from dSX to 94X si^estt our repair 
program's performance can also improve. A more complicated and/or heuristic 
program could improve repair efficiency. A simple extennon of oir progrwn wotjrid 
be more sophistrcatMl about jog^ng an arm to include iHuned, good cells. Even 
the current jogging procedure could be called several times, instead of only once 
at the end of ttiemain arm-growth procedure. 

Now consider our assumption that no fairity cell outputs a Ngh S at the 
same side-set where it outputs an alternating C. If a faulty cell outputs only 
constant signals, this assumption is obviously valid. However, this assunqstion is 
not valid if our assump^ons are relaxed to say that all FAULTY outputs of a cell 
are stuck outputs. In particular, it is not valid if a cell A's only fault is a high 
S.CHJT - say S.LCHJT. In tNs case incormng loading signals may be routed to the 
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falsely-touched neighbor B at the same time they are routed to the appropriate 
Array Programmer-intended cell. In this case we call cell A a branch cell, and cell B 
the branch arm's base. This branching is particularly vexing because Its effects might 
not be felt until much later in the testing. Consider the array of figure 3.20, 
whose only fault is a high S.R.In to cell (2 2). That is, (1 2) is a branch cell. In 
such an array the indicated state could occur. The Array Programmer would only 
know of the existence of the intended arm. When the intended arm tried to touch 
(0 0), the branch arm would touch (1 0), causing the subsequent test failure of the 
extended version of the intended arm that included (0 0). This failure could be 
caused by a faulty (0 0) cell, but in this case it wouldn't have been. Even if the 
Array Programmer knew the failure was due to a loading branch, it wouldn't know 
where the branch occurred; here cells (1 0), (1 1), (1 2), or (1 3) could have 
been branch cells. The problem is heightened by the fact that total retraction of 
the intended arm via lowering (1 3)'s S.U.In does not affect the branch arm. Indeed 
it may grow further if more loading information is clocked into (1 2). Happily, a 
working cell's Incremental and Total Retractor circuitry implies that attempting to 
load a good cell in a branch arm results in the freeing of all the cells from the 
loaded cell to the tip cell in the branch arm. 

There is a wide range of possible approaches to the loading branch. On 
one extreme, the Array Programmer could assume that this branching problem does 
not exist. If this assumption is invalid for a particular array, the Array Programmer 
may find itself hopelessly confused. Then it quits its testing attempts, and signals 
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Fig. 3.20 Growth Of A Branch Arm 
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that the total array should be discarded This is a fast approach that might be 
reasonable if the probability of a branch cell was low; for instance, if cells had 
many elements or arrays had few cells. 

An array can be successfully loaded even if it has a branch cell, if one Is 
willing to accept the extra testing involved. By our assumption that all faulty 
outputs are stuck at some value, a branch cell can only transmit loading information 
to a branch base if the branch cell's C output to the branch base works. Then D 
of that side-set is either: 

1) an alternating signal transferred by the branch cell, as in the 
example above; or 

2) a fixed D signal, which causes the branch base to be continually 
reset due to its being programmed into the STA<=1 state. 

(2) is no problem; it's (1) we're considering. 

The Array Programmer can use several facts to generate a list of 
possible branch cells. When (1) holds, some of the tip end of the branch arm is a 
translated version of the intended arm. This is true because the branch base 
receives the same C and D information that the branch cell receives. The Array 
Programmer can use this information, and its knowledge of the position of the 
intended arm, to generate a list of possible branch cells. Knowledge of which cell 
of the intended arm failed helps reduce the size of this list. This knowledge may 
come from noting that all cells of an intended arm from its base through some cell 
C properly transmitted their shift-register B; the cell touched by the branch arm 
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tmjchad cdl C of the celt to the tip side of cell C. 

Assume ceil A tried to toix:h cdl B, aind the stidsec^ent test Med The 
Array Programrr^er might suspect a branch ceil if not even ^ft-r6|^ster B of cell A 
((1 3) above) ot#uts property during the test A neighboring cell C, pert of a 
branch arm, may have touched celt A immwiiateiy after tt^ «nyt«ieeue iMcKng of 
cells A and C, thereby displacing arm A's tip to celt A. Celt A would thenlM Ideded 
with information intencted for cell B (see figure 3.21). 

When the Array Programmer suspects a test failure occurred (because 
of a branch ceil, it retracts the intended wrm The Array Provanmmr thm regrowe 
the arm throi^h cell A, and tries to terminate the arm vHth e tocf) at cdi C, the 
possible branch ceH digest to the former intended arm's tip. Tfris new arm is 
tested. An unsueeessfiri tMt suggests that the potmttirt iM'nich win's base, cell C, 
was the branch «^s bas», cell A, the branch celt, Is marked as totiMy flaMwd. If 
the test is successful, and th^e are other potential iM'aiKli celts closer to Um base 
of the intended arm, these cells are tested In the same way celt A was. ITiat is, 
arm A is retracted and then hooked into a potential branch ba»» cedt. TNs pn'ocess 
repeats until all iratenttiri l>ranch cdls are tested, or a branch ceM Is found. If alt 
tests are successful, Vtme was no tM'anch ceil. Testing wid repair conMntje as if 
cell A was merely unable to include cell B in tl^ shlft^regltter arm. In «iy event, 
tNs process assures tlwt no brarwh wm remains to clutter ixp ttie array, (if such 
an arm never affects intendMl arm growth, we don't care at^jt It anywiQr.) 

Note how the Incremental Retractor circuitry hcrfps in the exampto 
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above, it allows the Intended arm to touch and load a cell that's been part of a 
branch arm. <0f course, loading must be slow enough to make negligible the 
slightly different delays of C and information traveling through arms A and B.) 
Furthermore, it allows quick incremental retraction of arm A when a potential 
branch cell is found to be good. 

These branch location steps are illustrated in figure 3.22 for the array 
of figure 3.20. Incremental retraction is used between all the stages shown. 

In another type of possible branching, a branch cell transmits high 
S.OUTs to more than one cell AFTER the branch cell has been loaded. This type of 
branching, which is much less likely than the other, can be handled in a very similar 
way. 

Of course, various steps can be taken to reduce the probability of a 
branch cell. Instead of one S line for selection, a cell could have a larger set of 
such lines. Only the proper combination of inputs to these lines would cause a cell 
to accept loading information. This approach could make chance selection, and 
consequent branching, arbitrarily unlikely by sufficiently increasing the number of 
selection lines. 

The Array Programmer could send to a cell loading information stating 
ioading-input-direction, which the cell would compare to its Select inputs to decide 
whether to accept a command. This technique would also help reduce the effects 
of a branch cell by reducing the ratio of valid loading commands to total loading 
commands. These techniques, and others like them, would only be employed after 
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Pig. 3.22 Location Of A Branch Cell 
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a more thorough analysis of the probability of a branch ceil for a specific celt 
implemented In a specific technology. 

In loading more than one shift-register arm into an array, one must 
worry that a branch arm will destroy a shift-register arm that has already been 
formed and tested If this possibility is sufficiently probable, it's a good Idea to 
continue testing a completed shift-register arm while a new arm is being formed 
Effects of a branch arm can then be detected and countered before extensive 
damage to the completed arm machine occurs. Besides monitoring the integrity of 
the completed arm, this approach helps limit the confusion caused by a branch arm. 
In limiting our consideration of possible failure modes to those above, 
we are encouraged by a quote from <Von Neumann 66>: 

"The axiomatization of automata for the completely defined situation Is a 
very nice exercise for one who faces the problem for the first time, but 
everybody who has had experience with it knows that it's only a very 
preliminary stage of the problem." . . . 

"There can be no question of eliminating failures or of completely 
paralyzing the effects of failures. All we can do is to try to arrange an 
automaton so that in the vast majority of failures it can continue to 
operate." 
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Our discussion of testing and repair shows we can achieve Von 
Neumann's goal simply and efficiently by incorporating oir loacfing, testir^, and 
repair mecharnsms into a cellular array. The major timitatton of our cfiscuasion - 
the uncertainty of an aj^oiN'iate flaw model - wilt be redjced whM a particular 
technology and ccM layout are comidered for the sNft-rttgidter array. 
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Section 3.4: Production And Mariceting Considerations 

In previous sections we've considered the basic question of array 
architecture, testing, and arm growtK In this section we consider less fundamental, 
but important, points relating to specifics of production and marketing. 

Once an arm is extended slightly into an array, the arm has many 
alternate paths; the curves of figure 3.19 then apply. However, it's critical that 
the Array Programmer be able to penetrate the array via an arm base cell. If the 
Array Programmer can only access one such cell in a flawed array, there's a 
probability pflaw that that cell will be flawed, and the array will consequently be 
unloadable. One way to ameliorate this situation is to fabricate an array with 
Array Programmer-accessible bond pads to more than one cell - each a potential 
arm base. If there are m such cells, the probability that no arm can be extended 
into the array diminishes to about pflaw"*. Quick tests would establish which base 
cells worked. The Array Programmer would then use one or all of these cells as 
base cells for testing and arm growth. The base cells should probably be away 
from the edge of the array. One reason is that the edge is more subject to flaws. 
A second reason is that there are more directions for arm growth away from the 
edge. Another ameliorating solution would put a circuit on a slice that accepted 
extra-array inputs which told it which of several cell edges to logically connect to 
the slice's leads. For instance, one "cell" would replace shift-registers A and B of 
a cell by wires. This non-cellular part of a slice would be less likely to be flawed 
than a cell. 
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Ano^er important ciuestion relates to array size. How big shoulcl an 
array be? We know that all the procedures described so far work for arbitrarily 
large arrays. WeVe itfso seen many argiments for large arrays. Orw constraint on 
the size of arrays Is manufacturing capability, which is geared toward dicing a 
wafer of maximum 3" dameter into much smirfier dups. The current lOOX yield 
approach Iws Rnidted ctevelopmmt of siqiMMN't nucNnery and tedv^CNM for tfw 
realization of very large ICs. l4Dwever, Texas hstruromts (£d use a 3/2" dwiieter 
slice for dlMretionary wiring (see <SpafKlorfer 68>). We*ve also heard that 
Hughes dev^oped a 50-watt package for a 3" slice as part of the Navy*s All 
Applications Digital Computer program; unfortunately, we haven*t learned any 
details about this yet While many of Tl's and Hughes* techniques for mounting, 
packaging, coo^inib etc can prebtMy be earned evw to large celluiar arrays, that 
process may demand considerable investment. However, that process will 
inevitably occur, spurred by improvements in iC yiei^ We are not even close to 
a fimiamental linMt here. 

For ledtnologies that require power lines conne^ing many cells, 
increases in array size increase the probability of array-destroying power 
problems. The prc^M^lity of a power hm being open-circuited can be made very 
small by making the bus wide. Layout care can lower the charge of shorts 
between a power bus and a signal lin^ most such shwts wodd probdiMy not be 
catastropNc anyway. Nevertheless very large arrays should perhaps include 
protection devices in exh cell or block of cells. This circuitry could cut e shorted, 
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or even overheated, cell off from Its power source, before the malfunction blew 
the power line's fuse or sucked down the power line. The protection devices 
could be a fuse, or could be semiconductor circuitry, such as common translstor- 
SCR protection circuitry. 

In any case, the well-defined nature of the protection clrcuitry*s 
expected load would enable it to be very simple. Figure 3.23 schematizes a 
possible layout for power lines and protection circuits. 

Another power-handling approach would make a cell's supply of power 
controllable by the cell's neighbors. For Instance, any of a cell's neighbors could 
command that the cell's power supply be switched on or off. This could save 
power in an array, and reduce the danger of faulty ceils, by channeling power only 
to the cells In an embedded machine. Indeed a "power arm" could be "grown" in 
parallel with a processing arm into an initially quiescent array of cells. 

Another question relates to the size of shift-registers A and B. Having 
shift-register B longer than 1 bit helps in the monitoring of arm growth; if each 
shift-register B in an arm contains a known pattern of Os and Is, the Array 
Programmer can monitor the position of a faulty cell by noting the location of faulty 
shift-register B output. On the other hand, a longer sNft-register B demands a 
corresponding longer time to test an arm. Consequently a good length for shift- 
register B is 2 bits. Shift-register A should probably be a length consistent with 
maximum expected number of bits in a shift-register arm. 

An array yielding a maximum shift-register arm of a certain length can 
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Fig. 3.23 1^8 sible Layout Of Pcswer Lihee And Circuitry 
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be used to provide arms shorter than that length. This means an IC producer could 
customize the same array to various customer needs. An unusually flawed array 
could provide a small shift-register, and its package could be marked accordingly. 
Customers could even be given an IC with a variable-length shift-register whose 
length was controlled via a side-set's loader inputs. 

If function-specification state bits are nonvolatile, a shift-register arm 
can be loaded into an array before it's shipped to a customer. The customer has 
the option of access to loading lines, which allow him to re-program or repair an 
array. 

If the function-specification state bits are volatile, there are several 
customer-manufacturer interface options: 

1) If a customer has a computer or other appropriate digital machine, 
he has the capability for testing and programming an array. He can use 
these capabilities, and a manufacturer-supplied prograrr^ on untested or 
slightly tested (e.g., for functioning arm base cells) arrays. 

2) The customer can receive a pre-tested array and a description of 
the loading sequence required to form a specified arm in the array. 
This description could be in some non-volatile form, such as read-only 
memory, paper-tape, or paper. Loading an already-tested array is as 
easy as loading a shift-register. Power Is turned on, an S line is raised, 
and [4 x (number of cells to be loaded)] bits are clocked via C and D 
lines into the array. 
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^ A communication link, terminated by logic-interface macNnes on 
each end, could connect the manufacturer and customer. (The link might 
be a phona line or cable.) This link could be used for loading, and even 
testing and repairing, of a customer's machine by a manufacturer's or 
system house's computer. 

4) An array requiring very low power (mdi as a CMOS array) cotdd 
be flitipped around with a battery-suppty. 

In any event, a volatile array must be backed up, either by a machine 
capable of re-ioading or by a power-supply insiring preservation of the function 
state of the array. 

It's obvious that the techniques we've described for the shift-register 
arm machine apply to any arm machine. Arm machine reaiizatiorw are appropriate 
to many machines which are realized as a chain of modules, with each module 
communicating with at most two other modules, and only the mockdes at the end of 
the chain directly connected to the machine's inputs and outputs. Many one- 
dimensional ceffdsu' arrays have tNs characteristic, so they cotid be appropriately 
realized as arm macHnes in a flawed checkerboard array. The techniques for arm 
machines easily genm'aiize to the high-relcon «nd tree riMcNnes (fiscussed In the 
next two chapters. 
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CHAPTER 4: HIGH-RELCON MACHINES 

Section 4.0: Introduction 

This chapter discusses arrays embedding high-relcon machines. High- 
relcon machines have fewer restrictions on communication between their essential 
cells than arm and tree machines. In an arm machine, no cell may have more than 
two essential neighbors. In a tree machine, only one cell may actively output 
information at a given time. All the essential cells in a high-relcon machine may 
have four neighbors, and all essential cells may be actively communicating different 
information at the same time. High-relcon machines may therefore have speed and 
flexibility advantages. However, high-relcon machines are harder to test and 
repair because a cell may have up to four essential neighbors, and because 
essential neighbors in one high-relcon machine must be essential neighbors in all 
equivalent embedded machines. Powerful mechanisms - the loader, and balanced 
processing transmission states - allow test and repair of arrays embedding high- 
relcon machines. The description of a machine as an essential network facilitates 
repair by abstractly describing the machine in a repair-oriented way. 

High-relcon machines are conducive to a sequence in which the array Is 
tested, a plan for repairing the array is developed, and the array is repaired 
through proper loading of good cells. This contrasts to the interwoven processes 
of testing and repair appropriate to arm and tree machines. However, this 
chapter's methods still use a loading arm for loading cells during testing and 
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subsaquent rwpdr of tn array. Transmission linlis form test linlcs for tasting an 
array. Tliese swrn trm«mssion iliAs may wire tocetiier assentiri raNgiil3ors in a 
machine embedded in a flawed array. We detaii \ht lest and rep«r procaduraa 
that use these funcfion stot^. Experiments with repab^ procedures waVe written 
hetp us compare repair difficulttes for arm and Ngh-raicon machihas, and suggast 
ways to in^)rova tRX' r^Mlr pro^dires. 

AppiicaUoh areas most appropriate to high-reicon machinas era 
considered We present a ^n^e ceil, CeMral, which aniMas redization of the 
benefits of high^fatcon macNnas. General may ba uNid to raaUta Ntghhf paraital, 
arl!>itrary seqtMitiat machinas, within Hmits sat only by tha size of a tSanw^ array, 
its number of input^utput leads, and tha spaid of it» cihi|lioinants. Ganaral 
ambodes the meohai^sms we use to teSt, ioac^ antf repitrMgh>riieoh roacMnas. A 
General array miQr ambad a unh^arsal coii)t»^ar^a(«rtructor-r^>airar that uses tha 
test and repair procedures we describe. Geharat*s loiding mechanism may ba 
controti^ by «i extra-am^ Afray ProgriMnm^. Mkriovw', a madtine enrisaddad 
in a General ^ray may ba an Array Programmer; it can coisirol the loading 
mechanism of caHt^ Hs anvirenment via a ftineUon itita M transmits procasaing 
inputs to one sida*s loader outpirts. This amMas a maichina ambaddMl in a 
General array to te^ mK^date, md r^^r Its cirttutiU' Mi^rwvnent 

For specificity, we begin by c^tiHtlf^ the General cell. Then wa 
consider a generiri testing tuid repitr afiH^roadh for 6ttft>edtifv Ni^ralcoh macNnas, 
and compara this approiech to tha one used for arm machines. Wa diacuat 
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realization issues peculiar to high-relcon machines. A comparison of the properties 
of high-relcon machines to the properties of arm, tree, and non-array machines 
reveals applications most suited to high-relcon arrays. 



PAGE 157 

Section 4.1 The General Celt 

The General celt is amenable to realization of Nghly parallel sequential 
machines. This cell incorporates the medtanisms essential to our testing and repair 
approaches for Wgh-relcon machines. Function states for processing, transmission, 
and memorization of infwmatlon allow realization of an arbitrary sec^jentiai mecNne 
in the processing layer of an arbitrarily large checkerboard array. A Control 
function state that transmits processing inputs as loading outputs enables an 
embedded high-retcon machine to load cells in its environment Such a mecNne 
may control a teading arm and four test links to test, program, and repair Its 
cellular environment Two or more such machines may monitor and repair each 
other. 

Figure 4.1 gives symbols for the General cell's function states. Like the 
celts of the last chapter, each General cell only communicates directly with its 
neighbors or the extra-array world. There are no signal busses extending through 
a General array. We've discussed the testing and repair advantages of tNs type 
of cellular desipi The loaders of the Shift-register and General cells are identical, 
except that processing inputs can control loading outputs when a General cell is in 
the Control function state. Each of a cell's four sides has S, L, and D loader Inputs 
and outputs (as in figure 3.5), and a Processing input and output Uke the Shift- 
register cell, the General cell incorporates all the loader's options. The shift- 
register loaded by a loading arm has four function-specification state bits - FM, FO, 
Fl, and F2 - and three loader state bits - LOO, LOl, and LSTA. This sWft-register 
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Pig. 4.1 General's Function States 

Function states are shown for all values of (FM FO Fl F2) . 
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is reset when power is turned oa in all but one functim state, only loader inputs 
affect loader outputs. However, in the Control state each Processing input from 
one of three sides affects a different loader output at the right side: P.U.IN - 
S.R.OUT, P.LIN - C.R.OUT, and P.D.IN - D.R.OUT. TNs state allows a machine, 
embedded in an array as a collection of fiHKtion states, to re-program its celiidar 
environment by appropriate processing signals transferred to some cell's loader 
outputs. 

The Cross, L-tum» R-turn, and U-turn states are types of balanced, non* 
branching transmission states. Cross is a crossover; the others are bends. We*ll 
see that Cross, L-turn, and R-turn are very useful for testing and fault-evoidance; 
note their simiiartty to the shift-register cell's non-tip states (see figure 3.12). 
Cross, L-turn, and R-turn may combine to form a transmission link arm that snakes 
through an array. Such a link may act as a two-way wire bus, or sin^sly as a wire 
carrying Information in one direction. U-turn is useful in testing} note Its sIraJlarlty 
to the shift-register cell's tip states. 

State (- 1 0) is a memory state. In tNs state, FM is not used in its 
customary function-specification state bit role; instead It's a processing layer 
P.R.IN-selectable D flip-flop. A Reset input for this flip-flop is not provided, but 
this function is easily simulated by proper manipulation of P.R.IN and P.D.IN. This 
memory state is very convenient for realization of rei^ters, adcb'estable read- 
write memories, and other common memoty modules. 

The states associated with F2 - 1 allow convenient realization of a 
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Pig. 4.2 A Function Performed In Different Orientations 

(first of 2 pages) 

Function F: out= (a + c) (a + "b") (b + c") 

Some busses between opposite sides are not shown. 

A) Array A has inputs and output at its left. 
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Fig. 4.2 A Function Performed In Different Orientations 

(second of 2 pages) 

Function F; out=(a + c) (a + S) (E + c) 



C) A rotated version of Array A, aided by U-turns, 
performs F with its inputs and outf)uts above. 
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D) A rotated version of Array A, aided by U-turns, 
performs F with its inputs and outputs below. 
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combinational logic function expressed, for instance, as a minimunj product of sums 
or sum of products. Figure 4.2 shows that these states function and combine very 
much like the states in programmable logic arrays. These General states, coupled 
with U-turn, were designed to eliminate the severe waste of cells that often 
results from cell designs that only operate on signals coming from a given, 
preferred direction. Those designs demand the use of many cells to turn an Input 
signal into an appropriate orientation. Figure 4.2 presents sample realizations of a 
logic function, and indicates the ease with which General arrays operate on signals 
to or from various directions. This is particularly important for functions with many 
input-output lines. 

The fact that digital machines usually require extensive signal-routing 
explains the cell's emphasis on bussing signals from one side to an opposite side. 
This allows a cell to perform bussing operations at some output while 
simultaneously performing a branch, combinational logic, or memory function at 
another output. 

It's easy to see that arbitrarily large, properly programmed General 
arrays can perform any time-independent, effectively computable computatioa It's 
been demonstrated that today's general-purpose computers can perform such a 
computation if their memory capacity is unlimited (see <Minsky 67>). Like <Bank8 
71>, we therefore need only show the ability to realize an extensible general- 
purpose computer in the General array. The ability to realize a general-purpose 
computer comes from the availability of its basic components - Nand gates, wires, 
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and memory i^mertts. Extensibttrty comet from the Control state and reading 
mechanism. A-n array-embedded coroptiter can ba conttri^ed to control the 
processing inpiUs, and consecjuently the fight sl^*i loader outputs, of a Control 
ceil on a right ^le of the computer's peHphery. WeVe seen that appropriate 
loader signals 1o an arbitrarily cell aHow the growtfi of a loading arm to an 
arbitrary cell in a parfect array. The array-if^baiiid computer can conMqilMitly 
send signals to ^i&rease its memory as needed. 

Since sucii a mad>lne has a moveable construction arm, It can construct 
arbitrary digital machines in an arbitrarily large array. For instance, It cen 
construct a copy of itself. It is therefore also a universal constructor. 

Well see ttirt, f or arri^ faults of a ceftalh assumed nature, an Array 
Programmer can test an array and embed a perfect machine in a flawed array. 
Since the Array ProgramWer can be reirfized In a flawed array, the General cell 
allows universal repair for faults of an assumed nature. 

Thus the General array can support a universal computer-constructor- 
repairer. 

General is universal, but simple. A processing mechanism's complexity 

results in advantages and dlsadvanti^es whose Important^ depends on the cell's 

use. The need for a low proportion of flawed cells In an array embedding Wgh- 

relcon machines currently requires that only simple cells be fabricated on a slice 

containing many cells. Basic, universal cells allow an embedded mechkie's designer 

to exploit the paralleliwn in a given algflrtthm. Testing, repair, and signal-routing 
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require cells to assume transmission states; using a very complicated ceil in such 
a simple state wastes most of its complicated mechanism. On the other hand, a 
simpler cell has a smaller ratio of processing circuitry to loading circuitry; the 
simpler cell suffers from a higher associated overhead when the loading circuitry is 
quiescent When a cell's simplicity requires more cells for a given machine, the 
function-selection in each cell slows the machine. 

One component of a cell's complexity is its number of processing lines. 
If a cell has many processing lines in a side-set, routing each of the lines to or 
from a different part of an array requires many cells to break the tines from the 
side-set's bundle of lines. Furthermore, unless independence of different parts of 
a cell's processing mechanism is assumed, test time per cell rises exponentially 
with its number of processing inputs. 

An array designer considers these general considerations and specific 
design goals when designing a high-relcon array. 

General's processing mechanism is one consistent with efficient 
implementation of our testing, repair, and computation goals. The Cross, L-turn, R- 
turn, and U-turn states are important components of test arms and transmission 
links in testing and repair. Although General cells perform wiring operations in 
many states, signal-routing is so important that expanding General's signal-routing 
capabilities might be worthwhile. Some variation of the Control state is necessary 
for realization of our goal of array-embedded array manipulators. The sequential 
machines we envision for General would use enough memory to support a memory 
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state; construction of memory elements from gates would require a greater 
proportion of ceils in an array than is justified by the resultant simplification of a 
ceil. Indeed, actual applications nvght argue for more memory elements in a cell 
and/or more memory-oriented function states. Ifs true that cells with (Fl F2) - 
(0 1) are rotated versions of cells with (F 1 F2) - <1 1), and that cell states can be 
eliminated by clever use of the (0 1 1) cell. Again, these cell simpiificatione 
would probably result in disproportionate numbers of celts for most applications. 

We briefly digress to give a little information about a familiar machine, a 
miniprocessor, unique only because we designed it as a macNne embedded In a 
General array, and because a special feature allows It to test and repair its 
cellular environment TNs mirtiprocessor could be the processor of a universal 
computer-constructor-repairer. This digression is intended to give some specific 
information about our cellular realization of a machine like one many readers are 
familiar with; those who aren*t will not lose continuity by jumping to the next 
section. We don't think the General cell is particularly suited to realization of 
conventional f^ocessors, because processors are already mass-produced IQs. 
However, we do want to demonstrate the General cell's power. Furthermore, this 
design gives some insight into the number of cells of various types needed to 
implement a somewhat farroliar machine. 

The miniprocessor we designed is a 16-bit pw'allel, syncN'onous, single- 
sequence machine with conventional A-B-C bus structure Rgure 4.3 gives a map 
of the miniprocessor. The machine has 66 extra-«'ray lines: 1 dock, 1 interrupt, 
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Fig. 4.3 Map Of Miniprocessor-Tester-Repairer 



8 Data 
inputs & 
outputs 



t 



Interrupt 



Clock 



8 2-%*ay tcansmission 

links 



1^ 



-> 12' 



TIMING & 

CONTROL 

SECTION 



4 



i 



loading arm's base 



!S 



J 



n 



4 test-link 
bases 



I I 
I I 



1^ 



ALU/REGISTERS SECTION 



^ 



> I 



1 



31 



15 



^ 



I I 
I I 



:i5' 
I I 



16 

MEMORY 
INTERFACE 
SECTION 



^ 



Wrxte 
Memory 



16 Inputs From Memory 

15 Outputs For Memory Address 

16 Outputs For Mfflttory Data 



PAGE 167 

8 data inputs, 8 data outputs, 15 memory address, 1 "write memory", 16 memory 
data inputs, and 16 memory data ot^put lines. The machine also hat four test 
links, and one loader arm for testing and r^ptki we ditcuss use of these easily 
implemented features In later sections. The machine^s main sections are e Timing 
and Control section, a Memory Interface section, and an Arithmetic-Logic 
Unit/Registers section. Both the Memory Interface and ALU/Registers sections 
have 16 similar modules, one for each t>lt-s«ce. The Memory Interface Section 
contains the 14-bit Instruction register, and many transmission links. The 
ALU/Registers section contains six large (1$ or 16-bit) registers; these are the 
Accumulator, Program Counter, Instruction, Subroutine Return, Interrupt Return, and 
Input-Output/Test & Load registers TWs section's 16 blocks are Identical, except 
that the block Interfacing with the Timing and Control Section is slightly different 
The miniprocessor has fairly conv^itiond aritlvnetic, log^d, sutx'outine, interrupt, 
and Input-output capabilities. Instructidns are processed in a conver^onal, single- 
sequence way. 

We specified this machine as one embedded in a perfect, rectangular 
General array wHh about 9,000 cells. Its non-wrlUng indirect memory reference 
instruction takes three cycles, with about 700 cell-delays for each cyde. Since 
most cells introduce about one gate-delay, a cycle takes about seven microseconds 
for a technology with a gate-delay of 10 nanoseconds. Each rectangular 
ALU/Register slice pves an example of a mix of cell types; each has 18 unused 
cells, 147 transmission cells, 53 combinational logic cells, and 6 memory cells. Each 
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bit-slice has 88 essential ceils: 53 combinational lo|^ ctils, 6 memory cells, 16 U- 
turns, and 13 branches. There are 118 non-brancNi^ trwisn^sslon cells used as 
wires. Other parts of our processor-tester-repairer had an even Ngher ratio of 
wire cells to essential cells. This emphasizes the importance of good signal-routing 
capabilities in high-relcon arrays. 

Testing and repair techniques usii^ the (general cell depend only on the 
loader and processing transmission states, so the tesUr^ and repwr i^roach for 
General can be applied to other high-relcon arrays with loader and processing 
transmission ci^ialMiities analogous to General*s. 
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Section 4.2: introduction To Testing, Constructimi, And Repair 

Testing, configuration, and repair for high-relcon maciiines is similar to 
those processes for balanced arm machines, although there are important 
differences. The chief cfifferences are that hlgh-releon macNnet are not conduciv« 
to the interwoven processes of test and repair; and test and repair are more 
difficult and less efficient for high-relcon machines. We consider an approach 
applicable to any high-relcon checkerboard array with our loading arm and 
transmission link facHities. We mention how a Control state like General's may ba 
tested, but this state is not essential to our testli^ and rt^r (^tproach. 

In considering embedding an arm in an array, we made certain 
reasonable assumptions concerning failure modes of the array. Then the 
interwoven processes of testing and repair were considered. These processes 
occurred by the (radual snaking of an arm into an euray. A cell was tested only 
insofar as necessary to establish its successful incorporation into a desired arm; 
this usually meant a cell wasn't tested in all of its states. Testing of a new arm- 
tip cell required u^ng a partially tested ceil, but this presented no difficulty. 

In considering embedding high-relcon machines, we make assumptions 
very close to those made in the last chapter. However, most Ngh-relcon macNnes 
are poorly suited to gradual growth and testing for two main reasons: 

1) In growing an arm, the mmnber of relevant extra-arn^ processing 

inputs and outputs remains fixed However, high-relcon machines 

usually have a variable, sometimes large mffnber of relevant side-sets 
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as they're growa Most generaiiyi this requiret test arim Unking a test 
machine to the relevant sida-sets at a partially grown machine's 
periphery. TWs requires an Array Propwimer to have a variable and 
often large number of test arms and associated links. We*d much prefer 
to have a low, fixed number of such Imks. Consequent, we test ceils 
individuatiy, relying on independence assufiptions about ceiis' behavior. 

2) Embedding an arm in a flawed »-ray can be done efficiently by 
gradual growth of the arm, followed by local )ogpng of the arm to 

■ 

include clumps of good cells. High^raicon machines benefit greatly from 
a global repair approach that begins with a descripjtion of ail the flaws 
in an array. This means that repair efficl^nqf Is improved by separation 
of the test and repair procedires. 

These considerations explain why the test an4 rapair procasaas for 
high-relcon macNnes are segmented into a sarieaof seyara) distinct procedures. 

First the Array Programmer's Ttst procedure tests an array, noting the 
location of faulty ceils. This testing Is independent of the essential machine that ia 
eventually embedded in the array, so Test's reaults are valid until an array 
develops a new flaw. 

A Repair procedure determines how to embed a parff^t machine In the 
faulty array. Repair accepts a flaw pattern description of a flawed array from 
Test Repair also accepts an essential network model of the desired essential 
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machine. Repairs output is a description of the repaired arr^ that places each of 
an arrays cetis into one of the fottdwii^ fotir ci^ei^es: 

1) The cett is flawed 

2) The ceH is an essential cell. 

3) The oell may assume an arbitrary non-Control function stat& None of 
its outputs is relevant to the embeddid fMcNne*s output 

4) The cell is in a Gross, L-tum, or l^tam transmls^on state. The cell 
is part of one or more wires associated with relevent Inputs and 
outputs of essential cells. 

The Cwwrnwr procedure constructs a perfect macNne In a flawed array. 
Construct modifies Repair's output by mapping each of an essential machine's 
essential cell states into a properly located essential cell. Repair has arranged 
that essential cells be wired together In the proper way. Construct accepts from 
Test a model of ^e flawed array stating which side-sita may definitely be used 
for loading. Test develops this model as it tests an array. Every cell that Test 
finds to be geod has some side-set that can be used for loading the cell. 
Construct only activates the side-sets specified by Test as It extends a loading 
arm into an array. Construct's loading arm may touch ary good celt, but it irfways 
touches and loads essential cells {category 2) and wire cells (category 4). When 
Construct completes its loading task, a perfect macWne is embedded In the array. 
The embedded machine is ready for further test or use. 

Our high-relcon repair procedure assumes that the length of wires 
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between essential cells is irrelevant to the proper functioning of an embedded 
machine. Possible techniques for assuring the validity of this assumption are 
suggested at the end of this chapter. 
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Section 4.3: Testing 

Testing an array embedding a liigh-relcon machine involves the one-by- 
one testing of the ceils in that array via test links between the tested cell and an 
Array Programmer. This procedure is relatively difficult, compared to testing of an 
array embedding an arm, because Test doesn't know how Repair will map a perfect 
machine into the faulty array. This implies that most cdls tmist be tested in all 
their function states. Because all of a cell's accessible processing inputs and 
outputs may affect an embedded macNne's output, Test must vary the accessible 
processing inputs to the cell, and monitor the accessible processing outputs. 
Consequently testing a ceil usually involves linking each accessible side-set with 
the Array Programmer via a test link. Figure 4.4 shows that the procetaing 
transmission states ere ideirfly siited for tNs task. 

Test makes the assumptions listed below. Each assumption is andogoua 
to the correspond!!^ assumption made for sNft-register cdls. 

1) Good cell's are only loaded under Test's control, or because of a 
branch^ ceil, and not by signals caused by faulty cells. 

2) A cell's performance depends only on that cell's mechanism, state, 
and input sigrmls. 

3) A successfully tested cell does not develop a fault before the 
Construct process is over. 

4) A cell's processing outputs don't depend on its loader state; and, 
unless the function state is the Control state, loader performance 
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Fig. 4.4 Test Links To Processiag Lines Of Tested Cell 
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doe»nU depend on the function state. This non-essentid, reasonable 
independence assumption atlows a reckictton in testing time. Teat need 
not, for instisme, test a fuiction state iw alt load«' states. 

In considering testing, we first foci» on the iesi stages that occir when 
all tests are passed We then address implications of test falltres, end possible 
flaw models. The modtiting question Is pursyed In the aubtequent detcriptton of 
Repair. 

Testing a cell requires explicit tests of its perntfs«bl« fwction states, 
and concurrent Implicit tests of Its loader. Tests of a typical cell Involve two 
types of communication between the Array Programmer and the cells at the teat 
site. Test links connect the Array Programmer to the processing inputs and 
outputs at the test site, as In figure 4.4. The test links are composed only of cdla 
in the Cross, L-turn, or R-turn transmission states. The Array Programmer 
requires one test link to each accessible side-set. The Array Programmer 
communicates to a tested cell through signals to and from the base of each test 
link. Besides the test links, a loading arm extending to the tested region links the 
Array Programmer with loader inputs. This arm may pass through cells that are 
also in a test link, or even through the tested cell. (However, the Array 
Programmer should not relay Ngh processing signals down a test link connected to 
the up, left, or down side-set of a cell being loaded, and temporarily in the Control 
function state.) The Array Programmer may change the state of cells, such as the 
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tested cell, either by sending signals into the base of the loading arm, or by 
sending processing signals down three test links that converge on a Control cell. 

Testing a cell's non-Control function states involves cycling it through 
those function states the cell may assume in an embedded machine. For each such 
state, appropriate stimulus signals, and responses to these signals, flow through 
the test links. We'll see that Repair always specifies that a good cell adjacent to 
a hopelessly flawed cell assume a Cross, L-turn, or R-turn state; this Is an 
example of the tested function states being a subset of the set of all non-Control 
function states. In this case, only some of the tested cell*8 side-sets are 
accessible. A functional test of a non-Control, non-Memory function state involves 
at most 2^ = 16 input combinations. Fewer input combinations may be appropriate 
if some side-sets are inaccessible, or if independence of certain outputs and 
certain inputs is validly assumed. For instance, the left Processing Input might be 
experimentally found to never affect the right Processing output in the U-turn 
state, even for a faulty cell; this would allow simplified testing of the U-turn 
state. 

Testing a cell's response time in a given state is possible, if the Array 
Programmer can accurately time a test link's output response to an Input. 
Differential techniques then allow the calculation of the delay associated with each 
test link. Additional delay comes from delay through the tested cell. 
Unfortunately, accurate timing requires time resolution of less than one gate-delay, 
which is difficult to achieve. 
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If the Array R'ogrammer knows the dei^ through each teat link, teat 
time for a given function state deponds on how qac\^ the Array Programmer can 
change the Input to a test link. This is iimited by the Array Programmer's speed or 
the bandwidth of a cell. Ar^ inaccuracy in the estimate of the delay through a link 
may also limit test speed by effectively reducing the t»andwidth of the Hnk. 

In testing a Control function state, test links connect the Array 
Programmer to aji four of the skie-sets of the cell in the Control state Rrst the 
Array Programmer verifies that the right side-set*s lest link is not a test arm, by 
ascertaining that a si^ into the base of the test link doesnn return to the base 
after an appropriate delay. Then signals into the up, left, and down processing 
inputs command the tested cell to load the cell to Its right into a U-turn state. 
The Array Programme' again tests the right test link. If it's now a test arm, the 
Control state is good; otherwise the Control state is bad. 

Testing e cell's loacSng machanism is implicit in the tests of the cell's 
permissible function states. If a cell fails its function tests. Construct doesn't try 
to load it. If a cell passes its function tests, a loading arm has successfully loaded 
the ceil and retracted from the cell. Therefore Construct't loading arm can also 
load the cell from some side-set Test keeps a mi^ of which side-sets the loader 
uses to successfully activate and de-activate woridng cells. Construct uaea tNa 
map to determine the path of its loading arm. 

After a ceti has been tested, test links must be moved to a new test 
site, if there is any remaining. The new test site is usually a cell adjacent to the 
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last tested cell. Thus only the tip ends of the loading arm and test links need be 
moved. This is fairly simple, since a loading tip is at the test site. Each test link 
is gradually extended as part of a test arm, just as arms were extended in chapter 
3. After each incremental extension, all links are tested to assure growth is 
proceeding satisfactorally. Since a test arm only Incorporates cells in transmission 
states, faulty cells are discovered and avoided as In chapter 3. This gradual 
extension is particularly appropriate in an array with a high fault density. In an 
array with a very low fault density, the speedup from non-gradual growth could 
offset the slowdown from a faulty cell's confusion factor. 

The process of moving the test site terminates with each of the new 
test cell's accessible side-sets connected to a test link. The new test cell, in the 
U-turn state, is the tip of one or more test arms. The test process is repeated 
for this cell. 

In the last chapter we noted that failure after an incremental arm 
extension could mean several things. For instance, the new tip cell might be 
hopelessly flawed, or It might just be incapable of receiving information from the 
indicated direction. We noted that various flaw models might be appropriate, 
depending on the cell layout and the sophistication of the Array Programmer. 

This modelling difficulty again rises with the high-relcon array. Growth 
of test links is analogous to growth of shift-register arms, so the same comments 
apply. A similar difficulty arises when a cell is in the process of being fully tested 
The cell may produce nonsense in all states; modelling that cell as hopelessly 
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Saiction 4.4: Ftopwr 

The R«j3«r iM'oceclu'e deteriniiM»9 how tp mht^.tt |>^«ct» high>rdccm 
machine in a fiawed array. Test p.aii9| Rflfip^; a Umi pattern description of the 
flawed array. Most generdiy, Repar m^^ m^>ed the Itt'gest |^d mecMnfJt can. 
Construct may then construct in the ftawi^ arr|y„any inachine with an essentia 
networic that fits into this largest grid We consiokN' fl3d~emb«dcfing first Most 
actual embedded macNnes have essential qeits. with Jrr«tevant side-sett} their 
essential networks are grids with squires and (i(i)(s rnttiing. The roost jenerai 
Repair method is then less efficient than a nMith^ w|h^ np.ticet an incon^ete 
grid We eventually consider such a less g«wrai,.roore<effkier\t BfPai'' procedure. 
Its main drawback comes when a new machin* .rnupt be JHpiedded in the fiawed 
array; if the new macNne's essent^ netwerit ivrfi a ^jboatworK of the original 
essential network, new rfpair of the arri^ is^ffei^iry. ^|^r jdapidas hfw to 
locate and wire tqifther |f|pd ess<NitiiM c^|% uf||f^^|,||^ trfvp^smwi 

states as wlras, Jo ariM3td a paffi^cVi^acWnein a,|||*ii)^ arri^ X^a lyiowa 
Construct to assocpte the proper function state with aech^«antial cell, and to 
wire If^gether eits^tiai celli» with transmission: 9tateft4fi6tatMl^y Repair. 

The Repair procedures that wir have wri^ j|p«iM thf aimpjesl fault 
modej: a cell is either go<»di or hopelesf^ f^^ad. ^a4^i^|i^yaUM« «^ssi«fipti(m is 
most queftiQn^la, because of its harshne^ji^l4ilb%(if a#ide N^tfMt cells 
A and B is impasseltie. Ttus concfition can bMS^ss^ Q;||M|a:iby sfyini, ^t #ither 
(;ell A or cell B is hopelessly flawed RepfAr kofiop t|M^ «n iJnttfewed ceil fhouid 
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not allow a faulty cairs ouput to affect an ombaddad madii»yi*s output thua no 
output-aff^ctrnf tinir WW IM ^at^tti^ h all othw caaM 

Contii^ two ^h«d(arboard arraya virtth tha ai^ dltiribuiion of good 
and bad colli, and consttquantly the aalia fHlw pattern, tli* firat array ia for 
erMMddInf a r^ raaclNlM>; and the aeiifkl Is Welibeiii m Wm roicHlrie. Since 
there are many ways an irni can wTnd Oroui^ all thi eaientiat nodea of any of the 
gHd*8 reicort netwerkt, the lohgeat embedded arm In the aecond &rr^ conteina et 
least as many essential celts as the largest grid embe^ld^ in the first errey. 
Embedding a grid in a flawed array invefves usThg'some good cells purely ea links 
between issehtlil neighbors. Cells in the corre^oridifig pd^tfisri iri m aecond 
array can be ueed as arnr^s; because ceOil in liriks fta^o releoh es high ea cella 
in arms. Thus o^MiuRt re^reffielincy for t^ il at leaat eslgh ea 

^tlmum repair i^lilisncy for the rid iMiehitH |viim iMlNiW pltterh. 

i^sw^ the OfHimum efflefeneiei^tbitii:^ 
from a non-expiHliiental, purely tniithma^ perspective appears viry difficult 
An analytic, tractabte ebcpresslon f<r optimum tl^lkfiili^, given e perticuler 
flaw pattern, appears Impossible for mort ca^es. Ah ejtpresslon for iVerege 
•ffidency, averaged o¥«r # flaw dttMbutldns for a gh^ nmm^nwi9d ceita 
In art array of a certain ^ze, irfso appears i#c«lsible ttw both tirm end grids. 
Although tm iirtight flhd ioirnelower boundTfor i^piriffti^en^ Ifa Rlid^ M the 
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bounds would not be close enough to the optimum to be practically Interesting. 
Furthermore, one would still have little knowledge of the difficulty of attaining or 
surpassing a lower bound in an actual Repair procedure. 

Consequently, our approach has been to write promising repair 
procedures, observe their behavior, and use our observations to suggest 
improvements in the procedures. Some of these suggestions are implemented, and 
the process repeats. 

Many actual essential machines contain a mixture of low-relcon and 
high-relcon essential cells. Figure 4.5 gives the relcon network for our embedding 
of one bit-slice of the ALU/register section of our processor-tester-repairer in a 
perfect array. The upper-right region of the bit-slice has many high-relcon 
essential cells, and has few links to nodes outside the region. On the other hand, 
the bit-slice has many relcon-2 chains, balanced arms, and even relcon-O cdls. 
Many relcon-2 and relcon-4 ceils are used as a wire or crossover. 

In embedding the bit-slice in a flawed array, we could approximate its 
essential network by a grid. Adding constraints to Repair in this way would have 
three major effects: 

1) It would simplify the description of the slice's essential network. 

2) It would make Repair's results valid for any machine that fit Into a 
perfect 7 x 32 array. 

4) It would diminish Repair's efficiency. 
In this section, we first consider grid-embedding - the most difficult, general repair 



■::'-"'--f?^i^j-':^.-r^»;-'- 



PAGE 183 



Rg. 4.|^Rfl^Jips#/|j|rttFo^^ 




In the rectangular bit-slice there are 224 total cells: 18 refcon-0 cetit, 16 
relcon-1 cells, 78 relcon-2 cells, 28 relcon-3 cells, anci.i4^«f^:^^.(^il|«. The 
bit-slice's relcon network represents a compact embtckfug of a mMWiw with 88 
essential qeils: 53 <»mblc^Upoal jm^c c^llj^ J af^^ U-tum cdls. and 

13 b»*iinth clrtll. lis of the ti«c6P2 li^ i'^wi^ <^^ wm nen-brenching 
transmission ceils, which arsi used as wires. Qthpr parts fl| QMr.procMter-tMtw- 
rep«fer hid ah^iyihh)iNrt«ti<r^l^^ i^^yifi^fl^^; 
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in a checHer|g)a^ inray. VVf coinpiirft^f%«^%|f|p|^ W« 

then suggest an approach which improves en^jeddii^ effiilipyn):^, noticing nussing 
iinlts between no^et in a N|^-r^on jnp^uf'f ess#r^ ftetwork. This type of 
approach is the nK»st feiis^e for |(K>s^ 

It's uf^aiiy GH(ficult fo o^tMy «ni|i#3a ^ ^ji §91^ m^m tf the 
array has very few flavtfs, Gzt4, Rtpatr M^9^]b^c§^^gm:^^ rewfiHiabie 

wa^|i to interconnoQt good ceils t^forp, a iar^ gi^ As 1^ iH^ber of fteM^s in 
the array irx:mases, the lujo^er of r^NHS^e wi^ tfil^m^i^J^ai^frid ai>q:!^bode8. 
Repair cannot consider aU possible, emb^Ndbeiing^^lH^ liOMld t^9 to« much 
corapgtatioa The obvioMs, sif^ rtpwir iMtN^P^ Wipi^ irrays 

don't worit well. Eventually th^e ar« so many litiwf i» fii erriiy i|i«t the 
embedding probtemis <»asy, becaustU^ ^v^ thft^^io r^caR bft wnbedcied in 
th« arrety. 

We im» on the most dif ficMti ^iflii^fitm^f^-'M^^'J^^ ^A^ present 
a reasonable approach which % considwsais^ iQflf«Sfi;^ti^ 
simiJar approach wi^vii j»een» whijph is i^uiMfwja^ r^Pir o|;i^ 
arrays. 

The nucleus of Grid Repair is a JI«|^ M§i^ fX(if|^M|i- TMt fH'oiiMiure, 
which we'll detail, is very efficient at enpls^cldii^ ,gr|<|8 in mpderatety large 
rectangular arrays of flawed cdjs. Anottpr pr«cfchf^ ^/^clo^, iiccepts as inputs: 
1) an essential network for a madune ||f|i|^^ 

This network is describeclj^|nb|rcopected,fiiptpgu^ 
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^ • fliw liittim fir i flwMi i^ 

W« twnptiririty attuttM thit f^:^ ii^ dM^dw th* location of a 
flawed array's ifi|MC^ai#(tt itriM^ «w aiiliiathiytlili i^l^^ aftir an array is 
repalrafL W#N saa lliat ^SHIbkeff k aa# ifiomMHff^'^Miki^ the location of 
inM-oi:i^ HnM. StaeN»rf ptrmoHs «ia ftiifid^^i^ ^tetaifiulir Modes 
saparatad t»y fnlareowia^ioji %tH|H^ aieh IHddt ti Mlindad to tibl^i grid, 
etoekof f 4liiii^iics Tiiritt Raf»air to ilManiMlii fiotio ^ a piipw-^initf gr^ into 
each of tha Ma^. ff ^^m^i^ tttm^pwitm mWHwiomixf thi biodcs, 
Bl<«!l«»fl fits. mitimmrmodimfiMm ilfm^Hrfmm^^ 
rid lii#(s axtendhit from Md) t)N)dL If tt^tyceaieli, IKlidwff pasAsi ttM Piaiilting 
dasertption of tha I'apairwi arr«y io CUii^ttct if l^fcioff t^UMeem^ tha 
grids, it asics Twist Rapair for an altamata ambaddmg for at iaast ena Istodc in 
ra-rapaiHngany l»i«cl^ t>ii^ Itepilr^M^^ tha point of 

its last suaeass. tln^ praeass itaritas^untir IM^icibiadi iriitt. 

Rt^r 48 artentad Umif^ ractangtf ar liiit #r savaraf raasons. First, 
tNs is tiw most natural, tractak>la struetura in a ciwcicarkMard array. Saeend, Um 
Ganaral call ft auliwr to f«etangyiar macNnit. F1Ha%, lif li^iad^^ madiina 
can ba viawad as « i^m|loiita e^ r^tanglas of vaHays sheas. 

Wa first datii Twist Rsfpifr, and than filackoff. Wa axamina thair 
response to actual amt>i(ddlng proiiiams^ conipira thair pai^oi^manca to Arm Rapair, 
and not» IMr litfiita«oni. Wa ahM soisHit^i^^ of tha Rapair 
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procedures we've written. 

The simplest, most obvious way to embed a grid in a flawed array only 
uses a cell as an essential node in the grid if the cell's row and column contain no 
flawed cells. A good cell in a flawed line - a row or column - enters the Cross 
state, so it interconnects essential neighbors. We'll call this repair technique 
Stmple Repair. Simple repair of checkerboard arrays is analogous to Kukreja*8 
repair of cutpoint-connected arrays. 

Note that Simple Repair is the best possible grid-embedding repair 
when an array has few flawed cells. If an array has only one flawed cell, an 
embedded grid must have at least one less row and one less column than the 
flawed array; the flawed cell's row and column are bottlenecks. 

Unfortunately, this Simple Repair is very inefficient as the number of 
flaws in an array increases. For such an array, we'd like an approach that is able 
to twist a grid's lines through an array, so that some cells in flawed lines can still 
be used as essential cells. The L-turn and R-turn, cooperating with the Cross, are 
ideal for this purpose. Because of the way repaired blocks must interface, we 
assume a grid's lines must extend from one side of a block to its opposite side. 

The Twist Repair approach, which includes Simple Repair, uses 
horizontal and vertical adfustment lines extending completely through a flawed array 
(see figure 4.6). Any flaw on an adjustment line must be at the junction of a 
horizontal and vertical adjustment line. Adjustment lines break the array into boxes 
- rectangular regions of cells. At most one flawed cell is allowed in each box. If a 
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Fig. 4.6 Flawed 15x20 Array Twist-Repaired Into A Perfect 10x14 Array 
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Explanation: 

The retcon network above iixiicates the states of good cells and flawed 
cells in a grid-repaired array. Flawed c^ls are incKcat<^ by an X Good, imused 
celts in an arbttr«yttiAeai^ indicated fc^*.ta<i(xl«^ iM'e estmtiirf cells In 
the grid are indicated by 4-. Other cells are i^ted to interconnect essential grid 
celts. The L-turrt sitte is 1n<»cat«l by v^ ^, or ^di^fitfng bn the context 
Similarly, the R-turn state is indicated by 'V, '^', or '/-, and the Cross state is 
Indicated by -^ or t. Mote thai josuNig a wire ri^uNis tt^'uts of «^ liMt two L- 
turn or R-turn states. 
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line of boxes is free of faults, adjacent boxes in the line interconnect across 
adjustment lines via Cross cells. If a box is in a row (or column) of boxes, some of 
which contain flawed cells, one row of the box Is not used for essential cells. If 
the box contains a flaw, the flaw's row is the row with no essential cells; all 
unflawed cells in that row of the box assume the Cross state. If a box is in a row 
of boxes with flaws, and the box contains no flaw, an arbitrary row may be put 
into the Cross state. Thus all the boxes in a row have the same number R of rows 
useable as rows of essential cells. Cross, L-turn, and R-turn states are used In 
adjustment lines between boxes in a row to yield R embedded grid rows extending 
through all the row's boxes. 

Several considerations make the Twist Repair approach a reasonable 
one. Because exhaustive consideration of all repair possibilities is computationally 
excesssive, a reasonable, heuristic approach is necessary. Simple Repair is 
Inadequate for most arrays with more than a few flaws. Twist Repair recognizes 
the equivalence of many specific embeddings. For instance, an adjustment line that 
doesn't include any flawed cell may occupy any line of cells between two flawed 
cells; ail such lines are equivalent. Recognition of equivalence limits computational 
difficulty. Furthermore, this allows Blocltoff more flexibility in Interconnecting 
blocks repaired by Twist Repair. We found that forcing L-turn and R-turn links 
onto adjustment lines results in far less repair confusion and inefficiency than less 
restrained use of these states. Consider snaking an embedded grid's row through 
a flawed array of unbalanced cells, such as a General array. The only possible 
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essential ceils in the snalcing path through tlw flawed array are those cells which 
the path links to cells on the same row in the flawed array. This suggests that 
jogging of the Kne and movement of the iirw in the vertical direction shodd be 
limited Twist Repair often uses all the side-sets of cells in the L-turn and R-turn 
states; tNs efficiency helps nNniminrize the rAjmber of cells used as repair links. 
Twist Repair also attempts to place essentiirf rasif^^KH's close to each other in a 
flawed array. TNs is helpful for two reasons. First, since wires between •stential 
neighbors are useless as essential cells, it's important to minimize the nwnber of 
ceils in ead) wire. Second, an embedded machine's maxitmm speed is limited by 
delays through wires; intended processing is only done at essential cells. (^ 
ultimate justification for Twist Repair is that it is better than any other methods 
weVe considered for repairing small rectm^m' arrays to embed grids. 

The Twist Repair program's inputs are a flaw pattern and a request for 
a minimum acceptable number of grid rows and columns. As In the arm 
experiments, a square array's flaws are randonriy generated. Starting with a good 
gue#8 of where to draw adjustment lines. Twist IRepair considers alternative 
adjustment line placements exhaustively - ignoring equivalent placements - imtil it 
succeeds. Table 4.1 Is analogous to a table given for balanced arms, showing the 
best square grid Twist Repair embedd^ in experiments varying the m^i^r and 
distribution of flaws in the square array. 

Figure 4.7 shows curves based on the information in the table. The 
curves show the average of %of total for a given Xftawed, for various array sizes. 
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Table 4.1 Results Of THist-Repair Grld^MpbaWing gxparliisnta 

(Ist of 2 pageis) 

Key: Xflaued - flaued cells as percent of all cells 
cells - total cells in square array 
flaus - total flaued cells In array 
max-grld - the largest square grid our prograM enbedded 
5iof total - wax-grid as percent of qbIIs 
timelim - time limit, in seconds, 
t i me - the t i me the program ran 
Xoftimelim - time as percent of timelim 
* - For two starred (or unstarred) arrays of the same size, 
one set of flau coordinates is a subset of the other. 

Table: 

Xf I awed eel Is flaws wax-grid Xoftotal timelim time XoltlmaMni 

e lee e lea lee lea .ees e 

225 225 100 225 .005 8 

400 400 100 400 .005 8 

625 625 100 825 .005 8 

1.6 625 10 324 52 625 26 4 

« 1.6 625 10 324 52 625 95 15 

2 100 2 84 84 108 .11 8 

* 2 100 2 64 64 100 .85 8 
2 400 8 225 56 400 2.6 1 

» 2 400 8 225 56 400 5 1 

2.22 225 5 121 54 225 .17 8 

* 2.22 225 5 144 64 225 .16 8 
3.2 625 20 196 31 625 117 19 

« 3.2 625 20 Answer not found in timalim 

4 100 4 49 49 108 .87 8 

« 4 100 4 38 36 100 .03 8 

4 400 16 81 20 400 263 66 

* 4 400 16 100 25 400 231 58 
4.44 225 10 64 28 225 7.1 3 

* 4.44 225 10 64 28 225 2.4 1 
4.8 625 30 38 6 625 198 32 

« 4.8 625 30 Answer not found in timelim 

6 100 6 25 25 100 .88 1 

* 6 100 6 25 25 100 .85 1 
6 400 24 Answer not found in timelim 

» 6 400 24 64 16 400 24 6 

6.4 625 40 Answer not found in timet im 

* 6.4 625 40 16 3 625 233 37 
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Table 4.1 Results Of Tutst-Repair Grid-embedding Experiaiente 

(2nd of 2 pages) 
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Fig. 4.7 Graphs For Twist Repair Experiments 
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The smooth, consistent nature of these curves suggMts the eonciiisions iitted 
below: 

1) For a given array size, Xof total drops with increases in Xflawed 
This drop tends to be greatest for small Xflawed, miictor as Xfltwed 
increases, and non-existent after Xoftotal reaches 0. 

Consider the curve of %oftotal as a function of Xflawed, for a 
given square array. Let E be the number of cells in a line of the array. 
The first flaw introduced into the array forces %oftotal to drop from 
100 to <J00{E-1)2>/E2, wNle Xflawed increases from to 100/E* Thus 
the slope of the curve is 1-2E for Xflawed near 0. TNs e}q9lain8 why 
Xoftotal drops faster for larger arrays in this re^oa 

Consider an array with several flaws. Introduction of a new 
flaw may not cause a decrease in Xoftotal. For Instance, the flaw may 
fall at the intersection of two idjustment lines, or in a box where a flaw 
had been assumed (to allow the box to interface with adjacent flawed 
boxes, as discussed earlier). Over the set of all flaw distributions for 
an array, the probability that a new flaw wilt not cause a daeraata In 
Xoftotal tends to increase with the number of flaws In the array. At 
worst, a new flaw will eliminate one row and one column of the former 
repaired array. If the former repaired array is smaller than the original 
array, i.a., if tha repaired array hat any iimi, at worst tha ntw flaw 
decreases Xoftotal less than previous "worst possible" flaws. These 
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considerations help explain the fact that Xoftotai drops less rapidly as 
%flawed increases. 

2) Xoftotai drops faster with Xflawed fw larger arrays, because a 
given Xflawed implies a Ngher percentage of flawed lines for a larger 
array. 

We've already analyzed tNs situation for Xflawed near 0. We 
found the negative slope of Xoftotai versus Xflawed was directly 
proportional to a square array's side length, E As Xflawed increases, 
the particular distribution of flaws inftuencet Xoftotai. Howevtr, lt*8 
easy to see why Xoftotai tends to be smaller for larger arrays, for a 
given Xflawed. 

Given a fixed Xflawed, large arrays tend to have a higher 
percentage of flawed lines: the number of flaws Is proportional to the 
area, but the number of lines is proportional to the square root of the 
area. Consider two arrays, one with E-10 and one with E-lOO, at 
Xflawed = 1. For E=10, the one flaw implies Xoftotal»81. For E-lOO, 
the best possible distribution of 100 flaws puts each at one of the 100 
nodes associated with 10 horizontal and 10 vertical adjustment lines. 
Xoftotai is then 81. Most other distrilxrtions reqUre the Jogging of grid 
lines, and Xoftotai is then usually significantly smaller than 81. One 
extreme occurs in the uniiicely event that all 100 flaws occupy the same 
row or coiuma The array is effectively cut, so Xoftotal«0. 
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A more generd perspective provides a strong argument that 
moves in the direction of a proof. The flaw distributions in two arrays 
are equivalent if there's a one-to-one nmpfAng between the flaws in the 
two arrays such that the following is true, if an arbitrary flaw in one 
array has a certain relative position with respect to the other flaws in 
that array, the corresponding flaw in the second array has the same 
relative position with respect to correspomling flaws in the second 
surraf. If one of a flaw's coordinates is X, then the relative position, 
with respect to that coordinate, of a flaw whose corresponding 
coordinate is Y depends on which of the five following, mutually 
exclusive, collectively exhaustive statements is true: X4-1<Y, X4-1-Y, 
X.y, X=Y+1, X>Y+1. 

Now consider two square arrays with different sizes, but 
equivalent flaw distributions. The first array has E rows, n flaws, and 
E-F grid rows in an optimally embedded square grid. As Xflawed has 
climbed ^om to lOOn/E^ Xoftotal has dropped from 100 to 100(E- 
F)2/El The second, larger array has K.E rows. Since Twist Repair 
notices Its equivalent flaw distribution, the second array's optimum 
square grid has K.E-F grid rows. Here Xoftotal has climbed from to 
(K.R2 as Xoftotal has dropped from 100 to 100(K.E-F)V(K.E)2 The ratio 
of the change in %of total to the change in Xfiawed is (2E.F-F2)/n for the 
first array, and (2K.E.F-F^/n for the second array; an equivalent flaw 
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distribution is more costly in tl>e iarfir wtiv. Ary flaw distribution in 
an array has a corraspondrni, aquivaiant distribution in a larger array. 
However, the fact that not all flaw distribution* in an array have an 
equivalent distribution in a smaller array precludes simply extension of 
our reasoning to a proof that, for larger arri^s, Xoftotal drops faster as 
^flawed increases. It might be possible to make such a proof by 
defining some sort of loosely equivalent flaw dittributions. 

3) For a given array size, Xoftotal drops frwn 100 to fairly smootNy 
as Xftawed increases from to a number N dependent on array size and 
specific flaw distributioa (For oMr experiments, 11.2 s N i 17.7a) TNs 
contrasts with growth of arms, where Xttawed decraaeea gradually and 
smoothly until it reaches a point where it pluromets, usually for Xflawed 
approximately equal to 28. 

4) Repair efficiency is much smaller for grids than for arnris. 

5) For arrays with more than a few (approximately five) flaws, Twist 
Repair is far superior to Simple Repair. For instance, in the unstarred 
array with 625 total cells and 20 flawed cells, Twist Repair embedded a 
14X14 square grid. Simple Repair embedded a 4 X 4 square grid for 
the same array. 

8) The time to repair an array varies widely, even for a constant 
array-size and Xflawed. The ratio of the time to repair an array to the 
number of cells in the array tends to be Wgher for larger a''rays. For a 
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particyar array, the tima for repair is relatively low when there are 
very few flaws. As flaws are introduced, repair time tends to climb 
gradually, reach a pealc, and then detcend rapidly. TMs i« because 
repair time is roughly proportional to the mHitber of non-equivalent 
adjustment line {i^acements. If an array has very few flaws, there are 
few non-equivalent adjustment lines. As flaws are introduced, the 
number of non-equivalent adjustment tines increases. Eventually an 
array becomes so crowded with flaws that it*s difficult to find an 
adjustment line that doesnU include a flaw. If an adjustment line 
contains more than one flaw, several associated lines are reqUred to 
satisfy the constraint that every flaw on an adjustment line be at the 
intersection of a horizontal and vertical adjustment line. TNs reduces 
the number of non-equivalent Kijustment lines iw very flawed arrays. 

Experiments with Twist Repair suggest a new grid-embedding strategy. 
We notice that for a given Xfiawed, Xoftotal tench to be uiistMftirily Ngher and 
%oftimelim significantly tower for smaller arrays. TNs difference becomes more 
significant as Xfiawed increases, until Xfiawed is so Isrge that all grid-endaedding 
attempts are futile. This suggests that emi>edding a grid in a large array shoiid be 
done by bre?k ''«[ the array into bloci(S of optimum size, separated by 
interconnection strips. Each biocl( is repaired via Twist Repair, and its grid 
outputs are connected ecross the interconnection strips to the grid outputs of its 
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neighboring blocks. In fact, experiments show that such a procedure is superior to 
Twist Repair for large arrays with many flaws. 

A block's optimum size is determined by a tradeoff. Decreasing block- 
size tends to increase %ototal within each block, but it also decreases the total 
area devoted to blocks by increasing the number of interconnection strips. If 
%flawed is 0, it's pointless to waste any cells on interconnection strips; there 
should be one maximum-sized block. As Jiflawed increases, the optimum block-size 
decreases. Assume that the overriding factor in embedding success is Xoftotal In 
each block. For large enough arrays, the fraction of cells used in blocks, given 
each block has E cells in a line, is about (E/E+l)^ This number is 100/121 for 
E=10, and 400/441 for E=20. Using the curves of figure 4.7, this indicates that 
E«10 is superior to E=20 for '/.flawed greater than about 1.5, given our 
assumptioa This indicates how the curves and the value of (E/E+l)^ may be used 
to suggest an optimum block-size for a given '/.flawed. Experiments with Blockoff 
have confirmed that there is a fairly predictable, optimum block-size for a given 
flaw density. This fact of an optimum block-size suggests Improved grid- 
embedding can come from breaking an array into blocks whose approximately equal 
size is determined by the array's flaw density. Then the simplest approach assigns 
identical sub-grids to all blocks of the same size. This approach is limited when 
some blocks have a disproportionately high number of flaws. This situation often 
arises with current IC slices, where flaws tend to cluster. Since a very flawed 
block can only contain a small grid, that block is unable to link up with all the grid 
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outputs of its ndghboHng, less Hwmd biocks. This Rmitt tlw number of grfchrowt 
in its row of bioclcs. 

Before considering how Grid Repair should handle blocking off an array 
containing flaw dustm's, It's useful to examine the repair problem more generally. 
tt*8 quite clear that a heuristic approach Is necessary if Repair is to efficiently 
repair arrays with many flaws. Twist Repair is time-consuming, especlaliy when 
one wants to place a near-largest grid into a flawed array. We*d therefore like to 
be able to determine a priori the feasibiiity of a certain repair, In terms of 
computational difficulty and probability of success. This is particularly true if 
Blockoff is used to interconnect many blocks. Assume Blockoff operates on m 
blocks, and there eu'e g„ satisfactory, non-equivalent sub-grids that can be 
embedded in block m. Let P be the product of g,^ as n varies from 1 to m. There 
are P conrfainations of sub-grids wWch Blockoff may try to interconnect to form an 
embedded grid. If a high percentage of these P combinations are consistent with 
the desired grid, Blockoff may quickly succeed At the other extreme, Blockoff 
would spend a time proportional to P in vainly considering each of the 
combinations. 

Happily, Repair may use a rather simple heuristic approach to reduce 
repair time. Let F be a success function wWch estimates the grid-size that Repair 

can reasonably expect to embed in a given array. Most simply, F is a function of a 

V 
square array's size and its flaw density. F can be refinedln various ways we'll 

consider. For instance, an input to F could state the probability P« estimate is not 



PAGE 200 

an over-estimate. If non-square grids and blocks are considered, F can depend on 
their specified shapes. It's reasonable to obtain F experimentally, because of the 
monotonic nature of F. For instance, we've observed that Fs output decreases as 
an array's dimension or flaw density increases. This monotonicity enables us to 
estimate F by experimentally determining some of its key values, and interpolating 
to find its other values. F is Repair's heuristic guide. 

Now consider the following procedure adapted to embedding a grid 
containing R rows and C columns in an array that may contain flaw-clusters. Repair 
uses F to break the flawed array into approximately equal blocks whose size 
depends on the array's dimensions and average flaw density. F suggests the block 
size that is expected to yield the maximum embedded grid. Repair then considers 
each line of blocks, associating with each line a number equal to the number of 
lines F associates with the most flawed block in the line. Thus Repair recognizes 
the difficulty of snaking a grid's rows or columns through a cluster of faulty cells. 
Repair finds the sum S of all the numbers associated with the row lines. If S < R, 
embedding the specified grid will be difficult or impossible; Repair's action 
depends on whether it's willing to spend a lot of computation on what is probably 
a vain effort. (This decision can be made implicit if a success-probability 
parameter, like the one we've discussed, is passed to F.) If S - K.R, where K Is 
greater than or equal to 1, Repair multiplies each row number by about 1/K; so 
that all the row numbers sum to R. An analogous procedure is applied to the 
columns of blocks. If Repair decides to call Blockoff, Repair has heuristically 
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cl«t«rmineti Uw ctzd df the tdbHpid ntiliMd to tach block. thiJ» F fidlitatM • 
heurbtie for iptiMint iMiNft^ a givin rt^Mrtl lOiiSiliiiii^ if oi, tiHi itiiMnMit 
givon to^Oekoff. 

WoN^ fiotii tHot grid^ombfddifi If 
a chOfikeHsfoiNt n^iy. Jtm nta^ f^iliiUk'^i»p^iii»MtW'% ^iir procedure 
goared toward rtd-ombeddifiglf fir liMi^ if ^^r fiMlMit C^truct • 
fiawed Mrray 8|»klMid at a iar^ gi%^da^itrijet may in iha fiawtd array 

any macftino wHoao •ttofitiai riatwort i» a tuJ»-n«twor^ of ift« grid'a (Btaential 
network. Hdw^ivir; thia ganeraHt^ db«NiNM tfie efNciency y^ wMch a non-grid 
macNne fa iniMdetttf in « fiawed airi^ iVa^ aaan iKIa^W ttw axfa^eme mm of 
arm->enii>0<klnjt. 

Tlia 8io^(rff i^ocadura weVe writlm it i^NM^al avMn^ to accept an 
esiehtfat nOtwork canMning rec^Nfiiilir •ub-grkfo witti apeclRe^ tsetween 

adjacent sub-gHdii N*t easy 10 «ee w^ not^i% lifiiited comniunication patlia 
between a R^ei«ne*ii litgh^roteon reg^ JiNWiM WfiBiiriM l^^keff d0eratea 
under fewer c^buhita. 

Table At tnd figires 4.8 and 4.19^ «jn»iMrfie a Mries of experlmenta 
that begins to e>^riore iiow btock-iSxe Ind iilaii« gHd lyka affedt embeddbig Ngh- 
relcon rnaciiines. Tabfe 42 mimnMH'izet flii diti from tiie ikpeHflnenUi, and fiigure 
48 andA9 give BNKlMf^produMdp^^lhit# 

The expwrlRianta aH used t^a^^ S, WMk figure 47*s curvea 
suggest i» a reglorf whe»#i»tO(ik^«tee ott^m is better than btodt^siie of 20k20. 
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Table 4.2 E>^erimenU With Three I)^^«r4nl #flck9id^pp^ 



Blockoff operated on arrays with 5% flawed colls. For a given flawed 
array, at m^t the«i 4^l«c«nt 9m^ mm r mmi ikMmm f^/brformed. tn each 
expenment, Bloekoff was asked to place the tarot-tixed square grid in each of an 
array's approximately equal-sized btecks. Btoetofjri lest result is showm for eech 
entry. Unless otherwise' noted, Blockoff fourid^i^ i|n|Q«stble to achieve a better 
result, ^iven the eonitraihts. In 10-coniie|trSRMi6fffHecated 10 x iO blocks for 
each gnd, and tried td connect the^mSl imA 
Blockoff allocated 10 x 10 Nppks H h 
grids. In 20-cohnwit», SlOckgif f lillbcat__^ . 
intercormected bljDck^ffids into one JMii^ frid 
repaired 20 x 20 and 40 x 40 arrays ' 



J«'ge fl'id. In 10-nocoraiect. 
' Tntwconneci the email 
blocks for eech grid, and 
it figures Mntein the 



Array Expert went 



18xie 
20x28 
20x20 
20x20 

•f 48x40 
40x40 
40x40 

« 80x88 
80x80 

! 88x88 



10-connect 
10-ccmnect 
10-noc(Mnnept 
28-conr»ct 

10-conn«ct 

10-noc(tfme6t 

20-connect 

10-connect 

10-noconnect 

2d*cortnee# 



Best BtocKof f Relul t 


' Tine (seconds) 


1 GxG grid 


2.3 


4 4x4 grids - 8x8 


47.0 


4 4x4 grids 


9.2 


; 8x3^ijd 


63.1 


16 a^a ricts « I2)d2 


58.7 


IB 4x4 grids 


32.2 


4 5x5 grids - 10x18 


832. 


B4 2x2 grids - 16x16 


963. 


64 3x3 grids 


188. 



•i- When asked to put 16 4x4 grids in this. ftawAcLacray, Blockoff was still thinking 
after 45 minutes. Then wttJnterfupted and tenainetiaQ Bloekoff. 



* When asked to put 64^3 grids in this flawed airay, Blockoff was still tNnking 
after 27 minutes. Tiwr^ we interrupted and termlnKtid ao^off. 

! When asked to put 16 3x3 or 4x4 pldk in tNt ««lved array, Blockoff wet eUll 
thinking after 22 tninutis and 9.S mimftes, reijp^^. Hien we ifrterrupted and 
terminated Blockoff. 
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A) 10-connect embwit four 4m4 irt t i f c o n nae tid gricte 



1 


1 • * ■ 

i ■ ■ ■ 


■ ■ ■ 
■ ■ 




_ ^ i •, ■ • ■ 
• ■••? • • " 

L 11 ■ • 


' ' 


fe ■ ■ 
■ • • 

, *^Y 


^ ' 










«-*y«^ 


.; f - -. , 




^LiJ^ 




:i.Z::z-: 




•^" A 


^r 








M 




V^ v*^ 


'•••>< 








-13.: 




X 


■ 1 

f X- • 

• ■ 1 


"•X" 


.X; 




-, 


■ 
1 1 


1 •'■■ • :■' k 

. • 4 
. jt.i 




' 






• * ^T 4- 




1 


lx-« 


* ■ •■ f 


'^•^■» a 


y*Xf 


■i ■ 



B) 10-noconnect embeds four 4x4 uncmmeeled fHdt 
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Fig. 4.8 Biockoffs Repair Of 20x20 Array With 5% Flaw«d Calls 

(2nd of 2 pages) 



C) 20>connact embeds one 8x8 grid 
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Fig. 4.9 Blockoff 8 Repair Of A 40x40 Array With S% Flawed Cells 

(Ist of 4 p£^e$) 



A) 10-connect embeds sixteen 3x3 intercormected grids 
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Fig. 4.9 Blockoff's Repair Of 40x40 Array With 5% Flawed Cells 

(2nd of 4 pages) 



B) 10-noconnect embeds sixteen 4x4 unconnected grids 




Fig. 4.9 Biockoff's Repair Of 40x40 Array With 5% Flawed Cells 

(3rd of 4 pages) 
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C) 20-c&nnect embeds four 5x5 interconnected grids 
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Fig. 4.9 Blockoff's Repair of 40x40 Array With 5% Flawed Cells 

(4th of 4 pages) 



0) Only one link between adjacent 4x4 grids 
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Indeed, 20-connect always took substantially longer to repair an array than did 
lO-connect. Furthermore, 20-connect never embedded a larger machine than 10- 
connect, and sometimes embedded a smaller machine. 10-noconnect always 
embedded at least as many essential nodes as the others, because 10-noconnect 
works on a grid with links missing. We commanded Blockoff to place the same 
square grid in each of an array's blocks, because we didnH want to help Blockoff 
by implicitly telling it the location of flaw clusters. This constraint on Blockoff 
limited its performance; this explains why we can see ways to snake extra grid 
rows and columns through the flawed arrays. Figure 4.8.A indicates that the 
lower-right block of the 20 x 20 flawed array limited the performance of 10- 
connect and 10-noconnect. Similarly, figure 4.9.B shows that certain very flawed 
blocks limited Blockoff's performance. This argues for use of the success heuristic 
suggested earlier. Figure 4.9 indicates that Blockoff's performance diminished as 
more links were introduced between sub-grids. 

Comparing the graphs for Twist Repair experiments with table 4.2 
shows Blockoff's superiority to Twist Repair as a flawed array's size Increases. 
For %flawed equal 5, Twist Repair achieved a Xoftotal of 6 for a 25 x 25 array. 
This indicates that for 40 x 40 and 80 x 80 arrays. Twist Repair would have 
achieved %oftotal substantially under 6. For '/.flawed equal 5, Blockoff used 10- 
connect to achieve a %oftotal of 9 for a 40 x 40 array, and %oftotal greater than 
or equal to 4 for a 80 x 80 array. This and other comparisons we've made of 
Blockoff and Twist Repair indicate Blockoff is superior when ^.flawed remains 
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constant as an array's siztlncri^affs. 

We wish wa could oHar morf fxppriini||,|«i,|f «ults froiii Repair. 
However, Repair's tarMCO(nput|tifU^tiin|di^^ iix^irimentt 

unfeasible. 

Wf noyv r^-cor^der t|>ff^^»ve(^^,R^^ 
is oriented tow^d embeddiiis e rnacNrp abstcfpt^|f;b^^it)ed as |fit^:GiH¥l^ 
rectangular 8ub-|^i<^. Repair may use | »i^%appr|f|«fb;^^ ctecl^Xfhiitpfrl of 
a flawed array shoMld accept eaqtt sub-jrici, Af^ 1^#K "^^^n *f i^^f^^n^* 
Repair calls BIpckoft Pociwff inay pi a htMilfllc^^^^^^ to depM^ b^ to 

embed each sub-|rid If the 8u)?-fri<^.is SHflJli^entt)!; jsfwill, Twist Repair Is 
appropriate. QthervKtse Blockoff mi|y breaf( the t(%||l4^b)^ IHfck^ «icL prpMnt 
R(|pair with each M^-|ri4 That Is, F^r raiy b# ficu^y*. In ai^ caf% M^ray 
Is eventually broken ipto blocks repaired by Twwi i•ppir^ imd ii^i^c«iB|«(^fd by 
Blockoff. 

We've purposely Ifnored dif*M««iM^m.f flil!i»dd*4 maj^liijne'a 
interconnectiorif to ptbfr roachifies» eitlier in or put of j^irti^, Ti^a#offji ff^ating 
to tNs question are antlpgous to those for «rn^efii»|ddi|r«. Sp^off mpy be easily 
adapted to accepting inputs describing jwhkti of the ce|ls at a rmohine'a per^^hery 
carry the macNne's inputs and outputs. Ho^ng thif Jf liKai h|KieRing the iottrf ece 
between Iink9<i sijb-grids. In each case, ^pif^ar (;i|l.(lorfinstano<^ «ne^^^^ • 
lead to the extra-array world) should coonit^t to |pa||q^ cell 

One can envision further leyeli pi l^ip^t|0|^i«^p|ytipci^ whi^ la 
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increased embedcKng efficiency. Choice df io^ttt^Ubfifevel depends oh the 
character of BX^^^d repair pro^htif ^rlhitim^e, Very large arrays might 
benefit by intel^iiilcitiion stripsf \«rder '^iih ^ tyif 1^tlv6en large blocks. If 
substantial secti«N)s of an essentiid roacNne wen iiMy to have esstffitiitf t^li with 
few essential neighbors (as the General |)roceiksof>tester-repairer did), 
efficiencies would result from special handling of these sections. Indeed, perfect 
machines should probably be dei^gned Ih a modular faslMOfi, with retativ^ few 
cornTnunication pathift betweon fttodulet. thfe neUd to Hmit inter-mbdule 
comrnunieation pM? Is ilN'eady rkoin^!^ In the d^r^ of tonVentienai systems. 

We briefly sketch a promi^ng repair techhi^ for such high-relcon 
machines. An essential network is categorized In thelbtfowing way. Each 
^sentlal node with three or four ossential neighbors fiB ass6eiat#d with some 
rectangular ktgn-utcon block In a compifct Blocko1f-t«mp»tlbto way that's been 
discussed. Those wires and essential cells with <me or two essw^i^ m^i^dMrs 
that are not in a high-reicon block are associated with WrWcon blocks (see figure 
4.10). A strarlftt hoHtontal or vertical line ^t^ ah mkv^ network patees 
through at teast one hlgh-relcon or low*reloo« Mock, * that bidtk which the 
Success Heuristic F estimates as least eiffyentiyrepali'id, given the flawed 
array's average flaw ddMslty, da^iHwhef 1^ in^ should be 

allocate for an essential netWc^k line, f^r ihstatit^ lh«^ ^^>tetad embedding 
efficiency for ^« lar|« hi|h-reli»w bl«H( tBctitte 
(Alumni devoted tof perfect array coiumhs through ^ Tha relatively high 
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Fig. 4.1 1 Blocking Off A High>r«lcon Essential Nttwork 
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embeciding efficiency of transmission links dictates a lower multipie of flawed array 
columns devoted to colimm 6. Thus R^air uses Uie sunless heuristic to esUrmte 
whether a repair will succeed, and to fet e rough estimate of how to allocate 
flawed array space. Repair may then adjust lit Initial estimates by considering the 
actual number of flaws in each allocated block. Repair then uses Blockoff to repair 
the high-relcon blocks. Given a success here, Repair caUt a procedure devoted to 
aH the low-refcon ifsentM ceifs and wires that ariri*f'iA i Mg^raiicdn block. The 
main point of tNs f4>proach is to careiMly assoii^Me 0(i«Mmt1al cells with flawed 
array regions. Th# small amount of time 49vetfd jb h«M^^ car# guards a^ntt 

It 

vauH •xhaustive^.^uMWHiUiKinatty-expeosive Mkmp^i^ BtoekeH lo repair an 
array blocked off in an imrepdrat^eway. 

If Repair uses details of a macNne*s essential network to increete 
embedding effidwwy, Repair neeete a deso'iption of tluit network. In the least 
sopNsticated case, a dissigner codd specify that network to Repi^n weVe done 
this in our experiments with Blockoff. However, it is fairly easy to write a 
procedure which abstracts a maciyne's essentia rwtwork from its description as mi 
embedded m«:Nne. The procedure "works back" fr«n the enriseckied medNne*s 
outputs to find the essential cells and wires of the macMne. The resulting 
essential network coirid be blocked off by an^ysis of the location of Ng^relcoffi 
regions. Straight lines through the network that yidded a low den«ty of links 
would indicate reasonaUe boundwies between Mf^-rel^m rei^om. 

WeVe cKs^Msed an effective RojMir procedure, md actually writtan 



PAGE 214 

and analyzed fundamental components of this procedure. Nevertheless, repair of 
high-relcon machines remains a largely unexplored area. Some programs we've 
sketched remain to be implemented. Perhaps better methods of repair can be 
found More experiments would enable a better understanding of the heuristic 
success-function's nature. Interesting theoretical questions remaia Consider the 
curve of the expected width of a square embedded grid versus the width of a 
square flawed array, for some low, non-zero flaw density. Is there a repair 
procedure such that this curve is monotonically increasing? Is there a repair 
procedure such that the curve is above some positive-sloped straight line for very 
large arrays? Can you produce such a procedure, or prove there isn't one? This 
is an important question, because its answer tells us the expected size limits on 
grid machines embedded in arrays of a given flaw density. This helps us determine 
the expected size limits of high-relcon machines that aren't grids. 
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Section 4.5: Construct 

Construct accepts three infm'mation inputs wNdi cSciate how Construct 
loads cells in a flawed array. These inputs are: 

1) a description of an essential mst^ne stating essential states and 
associated wiring 

2) a dascription of a repaired array, in wNch each cell's processing 
layer function Is in one of the four categories we*ve mentioned - 
flawed, essential cell, {articular non^brMtehkig transmlMlwi state, or 
unused good cell in n-liritrary state; and 

3) a description of the repaired array stating the side^eett 
8uc<»ssfuily activated and d6-a(^vi^1)y llMfs toade^ 

WeVe noted that Construct's precise nature depends on Repair's 
generality. In any case, Construct is very simple. First Construct "mentally" map* 
a machine's essential cells into a rep«red array's essential nodes. Then Construct 
extends a loading ar^ into the flawed array, possibly toucNng all good ceils and at 
least touching and properiy setting all the cells acting as essential ceils or wires 
between essential ceils. The loading arm's base may be ar^ cell with access to 
the cells that must be set For instance, any of the celts of an embedded machine 
would be an acceptable base. Setting the proper ceHs is even easier than growing 
a long arm into an array. Construct icnows the location of flawed cells, and may 
extend, retract, or niove its arm through side-sets Test suecessfuiiy activated in 
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Fig. 4.10 Result Of An Experiment Showing Construct's Capabilty 
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Construct was asked to touch all unflawed cells, using a flexible loading 
arm with its base at (1 1). The picture shows the state of Construct's arm when 
Construct completed its task, after 65 seconds. All ^ cells are cells that have 
been touched by Construct's loading arm, but are no longer part of that arm. 
Construct finished in 65 seconds for this 25 x 35 array with. 50 flaws. 
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any way consistent with toucliing all the proper cells. Figure 4.11 shows the 
result of a simulation demonstrating Constructs ability to perform its task. The 
simulating procedure moved its arm in a flawed array. Ail cells were initially in 
either the X (flawed) or G (good) state. For simplicity, it was assumed that all 
accessible side-sets of good cells could be successfully activated and de- 
activated. The arnr. moved around in the array until ail touchable cells were 
touched. (This is doing more than is necessary.) The figure shows the state of the 
loading arm when Conistruct succeeded Of course, Construct could completely plan 
its loading strategy via such a simirtation b^ore actually extiancing its arm into an 
array. 
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Section 4.6: Other Considerations In Realizing High-relcon Machines 

We've considered the basic issues of testing, construction, and repair 
for high-relcon machines in previous sectons. Now we turn to less fundamental, 
but important, aspects of high-relcon machines. We considered production and 
marketing issues for arm machines in section 3.4. We suggested ways to satisfy 
constraints imposed by the need for adequate array-access ports (chapter 3 called 
them "arm bases"), the need for proper handling of shared power lines, and 
volatility. These constraints have obvious analogs in high-relcon machines. 
Because satisfaction of these constraints is also obviously unalogous, we need not 
consider these constraints further. Instead we concentrate on considerations 
peculiar to high-relcon machines. 

All the testing procedures we've discussed assume independence of cell 
behavior. Gradual growth of an arm machine involves concurrent tests of an 
individual cell and its associated machine. As soon as the 'ast cell of an arm has 
been tested, the arm is complete and tested. On the other hand, high-relcon cells 
are independently tested before they're included in an embedded machine. Testing 
an embedded machine, or its modules, checks our independence assumptions. An 
embedded machine may be tested like any digital machine, via its inputs and 
outputs. Furthermore, test-link capability provides testability to high-relcon array 
machines that's not available in ordinary digital machines. Test links may connect 
an embedded machine's module with a test machine, to allow independent testing 
of that module. A test link, terminated by a transmission-branch cell, may be used 
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as a probe which, at a given time, monitors the signals on a wire in an embedded 
machine. After test linits are used for module testing or probing, they can be 
withdrawn. Of course, this assumes that the test arms do not affect the operation 
of the eventually embedded machine; this is a safer assumption than a simple ceil- 
independence assumption. 

The use of cells as wires in high-relcon machines necessitates special 
considerations. In most hard-wired machines, it*s safe to disregard the delay 
through wires; but this assumption is usually not valid in high-essential machines 
because the delay through a wire cell is close to the delay through some other 
cell. A pair of essential neighbors may be linked by different-length wires in 
different flawed arrays. Wire delays consequently decrease an embedded 
machine's maximum operation speed Furthermore, they compound the "critical 
race" problem, thereby making array machine designs more constrained than non- 
array designs. A synchronous high-relcon machine must be clocked slowly enough 
to allow for the delay through embedded wires. Other conventional techniques for 
solving timing problems, such as ready-acknowledge signalling, may be employed 
where needed for communication between modules in embedded machines. 

Array machines compensate for inherent limitations by providing added 
capabilities, including automation-compatibility. We've seen that a simple array 
facilitates testing and repair by its iterative nature, and by the fact that test and 
repair facilities are built into a cell. An array's simple structure also facilitates 
computer-aided design. A designer could specify a machine as a perfect 
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embedded machine with timing constr^s ;0n Jts j^9>, %iH pftofffm^^^d 
check that an envisioned erobeddiTp.«siiU»fied t|)ei9e i;oii»lreints. A more 
sophisticated program coiid "compite" a ro«:^ne> hi||}rJevd-lanpu««e 8p#dfication 
into an acceptable array-embedded macNne; this if difHpuit, but easier than 
analogous con^utar-MdeKi design in a less regiiir f^pripfner^^ 
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S«ctf Of) 4.7: Higfv-rafoon Machine Applkitions 

Wi*v# difcusMd our qsprii^h t6 tKo milt dfHoilt tttttng and repair 
processes for a checkerboard array - tt^tfnent^bf Idi^ri^on rnaeNhes. Abitract 
description of ah essential maehfne as an essentia) ileiwoH(fotiisi» on the 
properties of a machine that are impdrtant to Mil a^ ripill i^^ that 

high-relcon machines have higher testing and repair costs than arm macMnea. 
WeVe also shown that, even for hiiM'cricm madtines, our cellulw epproech offers 
major integration, test, and maintenance advmtages relative to other metiwcte for 
system implementatioa in tNs secticm we cmmkur applications mwits of Mgh- 
relcon machines, relative to arm macNnes and non-array machines. We discuaa the 
General eel) as one which enables realization of the benefita of Mgh^relcen 
machines. 

Chapter 2 discussed the general advantages of cellular arrays, and 
argued for our amy approach. TMs appro«:h attempts to meet systwn design, 
production, and maintenance needs through standard, high-volume, flex)t>)e, 
automation-oriented modules - cells and associated programs. Vl% compared our 
approach to other, less constrwned approaches. QM|>ter 3 dscussed belwwed erm 
machines using our approach. Earlier sections of this chapter compared testing and 
repair processes for arm and Ngh-relcon machines. This section highiighta 
performance features that haven't been sufficiently covered. 

Because the communication paths between ceils in a Ngh-rdcon macNne 
are leas constrained than those In an arm maehtne, a Ngh-releon machine providea 
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speed and flexibility adyantaiBes In ««rtaUi ppplif aWpii*, Tli© fuitabWty of a 
particular type of wtenUal mact^,^(ifp«ii5)l «ig^^^ i^ m i^n df^m 4>f 

direct, simultaneous inter-cell corninMi|t<:i»t^m ^oserW^ ••««»- 

out shift-register is well-suited to arrn.tJi^nevbfc^ of tho 

register communicates directly with at mft|t 4wo jRthef^:^i|iw». , Tuf^^michinM ar« 
well-suited to macNnes whiich hf^y© only j^ i|«|ioi[fcjM*1H«^ glvgri tifflo; 

therefore random-access and sofne otiw,pp)iiop aro^w^ i^izatien 

as tree machines. In an arm macNne, essentia) neigfibprs v «iw«ys in fKl^ont 
cells. In a g;rld embedded in a flawed array, e«8i|n|ii|l neighbors aren't necessarMy 
In adjacent cells; this diminishes the fpeed advaatH« of th« emboddtd rid 
machine. Hlgh-relcon machine realj^litlon is purtiiaitiirty sutted to nwcWnM 
composed of modules wWch conpw^ate (#j(l«ce»rt ln^>^^ 
other modules at the same time. Suc^ niac|in«s^ ip^t ri^^ 
even awkwardly share the coriwHjnicftUonp^jija^ frm m^tm—. 

For instance, building a processor as jn^ ar»n or .t^ipaiaitee JA^euid ir^^ 
require complex cells, an^ suffer from low paraUeljwit J^or^ipftll emachinv't 
extra-array leads to connect to one cell nf«^8|ft|itt2(|tJon o^ very 

difficult. Thus high-relcon arrays provid« f^tj(m^ inf|Drfn8|k>n|M|N. b^ 
higher testing and rep^r cwts when th^ are usjid for *iMfe;!rolcon^^i^^ A 
high-relcon array is most suited to machlnits ivifWdi^fispWI; U^^^ 
information paths, such as the processor-tester-repairer built of General ceils^ 

High-relcon arrays, sMcb a? Genera^, arrays, of|er major ^KNmtages as 
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pwiph»fal e<|uipment in i computer system. '^ cort^)utiriy»lem offer* the 
array a nor^volatile Ar«iy Prdgraminer; %» #Ny offers a relliWe,ir»expen8ive, 
programmable, hrgh-8pe«#prdce8tfngc^MiNHty. 

For many wachine ias^, the ^Seneri! array Is a correct compromise 
between a single-sequence computer and a special hard-wired macWne. A typical 
conf»put«r% performance advantages include computational power, flexibility, and 
easy programrrjablli^y. Its major fladvifrtaills sl^w p^formance relative to 
hard^^wiped machines. The single-sequence computer processes only one 
Instruction at a time, with ftifch lns1[ri:Ubn1aKlng inariy gate-ilelays. The 
ameHorating parallelism In some Instructions is often wasted. For instance, an 
^gorithm that only operates on 1-bit words still lises an AND that operates on 
larger words. The conventional computer is particularly ill-suited to Irregular or 
Wgh-frequency real-time applications; hancjling incoming signals through Interrupts 
is parlicularly ttme-consuming and tricky. Computers are so clumsy at real-time 
applications that they often rely on a hard-wirdd machine to buffer incoming 
signals; this macWne continuously monitors, collects, and pre-processes Incoming 
data. ^^«ny apj^tuons areinore sW^edto e l^lal^ which 

offers higlwr speed; Ksadvantag^ ef iuli am«KJ*» incfud^ times and 

high^tup«osts, espe^fy if th<^e costs we n^ dWilibutii'ovw number 

of «»»ehine«. Testing «fid repair ««tlw*eif«iihlr»rt3i^ end 

costly. 

A peripheral array, such as the General array, is a compromise between 
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the performances of these two most common machine approaches. The array may 
be quickly and easily programmed to one of a large set of embedded machines. 
For Instance, a processing-intensive problem could be solved in an array which 
interrupted the computer's processor when it had solved the problem. Alternately, 
a General array could be used as a processor component tailored to the 
requirements of a particular processing task. The array provides a high degree of 
potential parallelism. Basic cell operations, those that occur in the cell's function 
states, are faster than basic computer operations, but slower than the basic 
operations of a special-purpose hard-wired machine. Like a hard-wired machine, a 
General machine can continuously monitor and process incoming signals. 
Furthermore, our arrays have the added advantages of low cost and easy, 
automatic maintenance. 

Of course, an array's suitability depends on its intended application are& 
The General array is oriented toward narrow data words; there is only one 
processing input in each of a cell's side-sets. Parallel algorithms, especially those 
amenable to two-dimensional array solution, are particularly appropriate for the 
General array. Many physical problems, such as temperature distribution on a 
plate, are consistent with such an array solutioa Such an array might benefit from 
larger processing side-sets to accomodate numbers representing one of a wide 
range of terhperatures. However, such a macro cell with large side-sets could be 
built of General cells. The General array is good at logic simulation. Real-time 
applications which would otherwise require an expensive, low-volume special 
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machine are often suited to a high-relcon array. 

An array's utility depends on its size. This partially explains our 
interest in array repair. One way to increase an array's size is to interconnect 
arrays. If one's objective is a large array with a checkerboard array's 
interconnection network, one must currently make many interconnections between 
neighboring arrays. This is fairly costly, even if one uses a special interconnect- 
array circuit board, because of the many IC leads involved Our approach reduces 
the need for many leads between sub-arrays by relying on testing and loading 
arms, and by Repair's block orientation. This block orientation recognizes that 
most machines are composed of modules, and have few communication paths 
between the modules. 

A high-relcon array may also replace special-purpose machines In a 
computer system. Here the array is most appropriate when computer- 
maintainability is important. 

The ability of an array-embedded machine to test, program, and repair 
its cellular environment is particularly attractive. Such a machine can form its 
cellular environment into machines appropriate to a given application at a given 
time. Two or more machines like the one we've designed can achieve high 
relibability by monitoring and repairing each other. Each machine is embedded in a 
sea of spare parts, cells, with enough cells to support many cell failures. When 
one machine notices that the other has failed, it re-tests the other's environment 
before embedding a new, perfect machine. With three array-embedded machines, 
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a first good machine may continue normal operation while the second good one 
repairs the faulty machine. 

It's amusing to consider the unlikely event of a form of "array cancer", in 
which a faulty machine attempted to wipe out a properly working machine. Each 
embedded machine could guard against inappropriate attack with test arms for 
noticing attack, and a loader arm for fighting the attack. A machine could be 
programmed so that both its defense and attack programs required proper use of 
all the machine's processor sections. With the right attack and defense programs, 
a perfect machine should then be able to dominate a malicious, faulty machine. 

If the General array is non-volatile or easily backed up by a power 
supply or loading source, it may be mass-produced and program-customized to 
provide inexpensive, low-volume machines inappropriate to microprocessor 
realization. Sometimes added advantages come from the machine's nature as a 
standard part that can be tested, programmed, and repaired through limited 
communication with a standard machine. Our arrays can even be repaired by a 
remote machine connected to an array via communication links. 
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CHAPTER 5: TREE MACHINES 

This chapter discusses embedded machines with a particularly simple 
nature. All cells of a tree machine are effectively linked to a common input bus 
and common output bus. Each cell is a balanced, essential cell whose function 
state includes a unique name. At any given time, only one cell may transmit its 
information out of the embedded machine. Examples of such a machine are paged 
random-access and track-addressed sequential-access memories, with one cell per 
page or track. The embedded machine's simplicity means that a cell's processing 
layer can be designed so that all tree-like relcon networks with a given number of 
nodes may correspond to equivalent embedded machines. This allows efficient use 
of good cells in a flawed array, because an embedded machine can incorporate any 
good cell linked to its input-output (tree base) cell by some path of good cells. 
Because one form of tree is an arm, a flawed array embedding a tree machine can 
be repaired at least as efficiently as a corresponding array embedding an arm 
machine. Furthermore most large, flawed arrays may be repaired to embed 
random-access memories or other tree machines with average access time 
proportional to the square-root of the number of cells in the array. 

For specificity, we consider a paged random-access memory 
implementation with the following characteristics. The RAM has 2^ pages, or celts, 
with 2** words of length L in a RAM on each page. Command and output words are 
handled serially. The RAM has two input lines called Klock and Command, and one 
output line called Return. When the RAM is ready to receive a command 
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specifying a "Read" or "Write" operation, the Command stream Klocked into the 
RAM specifies the following: 

1) a p-bit page address which selects the one cell of the embedded 
machine with the identical name stored as p function-specification state 
bits; 

2) a w-bit address selecting a particular word within the page; 

3) a Read/write bit specifying either a "Read" or "Write" operation; 
and 

4) if the command is a "Write", the L-bit word to be written. 

If the command is "Read", the L Klock pulses after the commund Klock the selected 
word out of the embedded machine. 

Since the paged-RAM cell's loader is the same loader detailed 
previously, we focus on a balanced processing mechanism and associated function- 
specification state bits for cells in a checkerboard array. Each of a cell's side-sets 
has one Insel input line specifiying whether that side-set is relected to send Klock 
and Command information directly into the cell. A ceil in a working embedded 
machine has only one of its Insel inputs high. The Insel-selected Klock and 
Command information is broadcast to the cell's neighbors via the cell's Klock and 
Command Outputs. A cell's broadcast Return output is that cell's RAM Output line 
if the cell has been addressed; otherwise the Return output is the OR of from 
zero to three Return inputs selected by four Retsel function-specification state 
bits. Each Retsel state bit corresponds to one of a cell's side-sets. Besides 
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Fig. 5.1 Relcon Networks For RAMs In Identical Rawed Arrays 



A) Relcon network for one embedded RAM 
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B) An embedded RAM with better access time than A 
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Commands input to a tree's base flow to the tips of the tree. Every link 
that carries an input command in one direction carries a Return in the 
opposite direction. An addressed cell's Output information successfully 
reaches the embedded machine's output because the Output is ORed 
with Os as it flows to the embedded machine's output. Maximum access 
time is minimized by minimizing the longest path between a tree-tip and 
the tree's base. Machine B*s M:cess time is better than A's because A 
has a circuitous path to node (3 0). The boat expected access time 
results from placement of a tree's base at the ciiiter of Its associated 
array. 
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determining whether a cell's left Return input is selected to be ORed, the "left" 
Retsei state bit is also the cell's left Insel output. A corresponding statement is 
true for a cell's "right", "up", and "down" Retsei state bits. Thus an embedded 
machine is organized so that cell A accepts a Return input frsm celt B If and only if 
cell B accepts Klock and Command inputs from cell A. Input command information 
enters a cell from one of its neighbors, and is accepted by up to three of Its other 
neighbors. Hence a given RAM with c cells can be realized by any tree-like relcon 
network of c good cells consistent with the limits imposed by an array's 
interconnection network. Figure 5.1 shows relcon networks for two equivalent 
machines in identical flawed arrays. The machines differ only In their access time. 

In a checkerboard array, access time is minimized by placing a tree's 
base cell at the center of a square region of cells; one diagonal of the square is a 
row, the other is a column, and the diagonals cross at the tree base cell. When 
such a strategy is used, the expected time required to senc information to or from 
the tip of a tree embedded in a flawed array is proportional to the square-root of 
the number of tree cells. Expected access time is therefore proportional to the 
square-root of the number of cells in large tree machines. In an n-dimensional 
array, this expected access time is proportional to the "n"th root of the number of 
tree cells when the tree's base is at the center of a cube o" hyper-cube. 

Since the cell we've discussed handles information serially, it needs a 
counter and associated circuitry to coordinate activity. This counter is initialized 
by the loader. 
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Techniques used for improving perfm'mance of convmticmai RAMs, such 
as use of parity bits, are applicable to this approach. The RAM in each cell Is 
identical to conventional RAMs. The fact that a loading arm can rapidly shufWe the 
names of cells in a macNne without disturbir^ their RAM contents may be useful 
for some systems' memory management If a simple pai^ng ^stem is willing to 
effectively construct a page table by shuffling the namet of memory cellt, a special 
page table and its associated delays are not required. 

Test and repair of flawed arrays en^»eckfing tree macNnes is similar to 
test and repair for arm machines. A tree machine is grown cell-by-cell Into the 
area arourwl its base, and each extension is monitored by communication between 
the tree's base and the Array Programmer. A cell in an embedded machine is 
always linked to the base cell by the shortest possible relcon path, and given a 
unique name. A working ceil in an embedded macNne ignores inputs from flawed 
cells and dangling array inputs. 

Packaged memories are easily formed into larger memories by providing 
a few links between packages to allow growth of the tree through all the 
packages. The number of cells In the tree is only constriHned by the required 
access time and the number p of page-address bits in each ceH. 

Overhead circuitry coirfd be reduced by using triangular arrays instead 
of checkerboard arrays, if this was compatible with the production process. 

It's obvious that this approach is appiicabte to any machine which may 
be realized as a tree machine. Inputs and outputs to such a machine could be 
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parallel rather than serial. One could Implement a mai^'^trackecl sequentid-access 
memory, with one cell for each track. Associative memories and even some multi- 
processor systems (similar to the ETHER system) are compatible with this 
approach. 

These tree machines further evidence the fact that relaxing the 
requirements on the communication paths between essential cells in an embedded 
machine facilitates repair efficiency. 



4 
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CHAPTER 6: CONCLUSION 

This thesis has presented an LSI-oriented systems approach to test, 
configuration, and repair of cellular arrays. We've specified standard modules that 
are built into the cells of a machine to facilitate testing, loading, and repair. Thus 
the mechanisms for testing and customizing a flawed array are built into a simple, 
iterated part. A computer may access these mechanisms via a few direct 
connections to an array. Programs allow the computer to maintain or re-customize 
the array. We've been careful to note our assumptions, and to discuss design 
approaches that help insure the validity of these assumptions in actual arrays. 

Development of terminology and models for programmable logic machines 
has helped us analyze important machine classes; these are arm, high-relcon, grid, 
and tree machines. A particular class of machine is characterized by the 
requirements placed on the communication paths between essential cells of any 
embedded machine in the class. A particular embedded machine is associated with 
a set of equivalent embedded machines. The nature of this set affects the 
testability and repairability of an array. Properties of a cell, such as balance, 
affect an embedded machine's structure and associated equivalence class; and 
therefore affect the repairability of an array. 

There are reasonable practical and theoretical extensions of this work. 
We believe that tying further theoretical inquiry to actual machine realization goals 
will be most productive. 
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Tree and arm machines seem particularly^, syited to jmmediate array 
realizatioa These machines are relatively simple tp aMipmaticalty teet, repair, and 
re-customize. Furthermore, their reiatiydy I<nk cirfl-^^ftultry overhead and Ngh 
repair-efficiency IP ve them major integration-ievel advantages. 

Arm or tree machines may first be conctru|:ted in a system containing 
many ICs. Such a system wogld enable further expi^r^tion and demonstration of 
the feasibility of our approach. The roa^or advantafOf of a many-IC system, 
compared to a system intei^ated on one slice, are its low development cost and 
N|h component accessibiiity. Such a system should be able to function when some 
of its ICs are removed or destroyed^ and some of Its wlret are cut The mary-IC 
system would enable us to refine our deslr«s and our test and repir progreme. 
The major limitation of a man^-IC research veMcfe la th§t It djoesnU preciaely 
model our ultimate goat, a cellular ^tem iritiirated on one fiaw«d tlice. 

Besides its obvious value as a ^stem coinponen'^ a singl«-slice tree or 
arm macNne would help answer important. questions relevant, to other arrays. How 
accurately do our assumptions model actual co^tl^^ on f flfwed allct? How 
significant is the branch cell problem? How do po^irer siflM^fi hfat diiiaipation, 
array size, and other practical considerations affect array implementation? 
Answers to these questions will deperid partly on the engineering skill end 
production care of array devdopers. Since arm and tree mecNnes are simpler to 
implement than many other programmable 1(^ nifchin^ |^i|^ tp inpkirnent theee 
arrays is a sjne qua non for practicality of many pUprprop^e^ arrays. 
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Arrays like General are the most exciting, because of their use of 
simple cells acting in parallel to provide universal computation-construction-repair 
capabilities. These arrays offer speed, reliability, and flexibility advantages in a 
low-cost integrated circuit. Current IC densities and yields probably don't allow 
practical realization of these large arrays. However, densities and yields are 
improving so rapidly that these arrays should be feasible before 1980. By then, 
many questions pertinent to these arrays should have been solved for tree and 
arm arrays. Continued work on testing and repair, and development of computer- 
aided design facilities for these arrays, will be important to their commercial 
success. Consideration should be given to the machine organizations most suited 
to high-relcon machines. 

The first use of our approach to high-relcon machines may be In many- 
IC arrays of fairly complex machines, such as microprocessors. This is true 
because these arrays are closer to conventional digital systems. Unfortunately, 
the fact that such arrays have relatively low basic operation speed compared to 
General means they don't use high-relcon arrays to full advantage. Nevertheless 
we've seen the advantages of building simple test and repair mechanisms Into an 
iterated component. 

Since our test, configuration, and repair techniques may be adapted to 
existing arrays, it would be useful to categorize these arrays according to their 
realizability as an arm, tree, high-relcon, or other class of embedded machine. 

Other inquiries may take numerous directions. A more rigorous 
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treatment of our testing atsumptlmt tncl approach would be useful. Many 
questjont remain eonceming the repair el chadia^b^^ iirayt tJMt embed Ngh- 
reicon machines. These questions concern the best way to repair these arrays, 
and the limits of this repair. A better uiidirstaR(;^.9f,^refwir wW aUpw better 
estimates of the reliability and maintainability of Ngh-relcon macNnes. The use of 
a plurality of high-relcon machmes in a letl-rfplric^ l^tem aheuU i>f e^^ored. 
The reliability and mikitaif^iHty tevi^t thft .dii^ fcbleved by our yarioua 
machines should be compared to the levels acNeved by other machines. The 
network models and terminoloiy weVe devf^^ lor ef||biGi^ iB^ be 

refined, and new niachine classes can be idfntifi^ a^stMG|«d. Our trentnient of 
testing and repair for checkerboard arrays can be Mctended to arrays w»th other 
interconnoction networks. 
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